Save crawl file in a cloud bucket #396
Closed
antoineeripret
started this conversation in
Ideas
Replies: 1 comment 5 replies
-
|
Thanks again @antoineeripret !
Let me know if that works, and how it goes. Here's another interesting case study :) |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello @eliasdabbas,
It's me again with a new idea.
Context
I'm trying to create a crawl logic (including your library) through Airflow. It works great, **the issue is the amount of data created by the library in the local storage. ** I'm trying to rely as little as possible because the amount of data could be quite important in the use case I'm working on at the moment.
Idea
What I attempt to implement is the following:
By doing so, the file would be kept to a reduced size, while the output of Advertools won't be affected. The issue is that I'm not sure if there is a way of "stopping" the crawl without losing the queue.
Do you happen to have an idea?
Thank you!
Beta Was this translation helpful? Give feedback.
All reactions