Skip to content

persistStorage and AWS #2464

Closed Answered by janbuchar
tsrdatatech asked this question in Q&A
Discussion options

You must be logged in to vote

As of now, there is no native S3 storage adapter in Crawlee. Off the top of my head, you could use EFS and the default MemoryStorage, which is backed by a local directory.

Or if you insist on using S3, you could pull an S3 bucket right before calling crawler.run() to your storage directory and upload it again after finishing - surely there is some npm package that can do that 🙂.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by tsrdatatech
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants