Python Forum
Deployed Spider on Heroku: How do I email downloaded files?
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Deployed Spider on Heroku: How do I email downloaded files?
#1
I am new to web-scraping and deployment of spiders.

1) Basically, I am in the midst of creating a spider to scrape some items and download Excel files from a website

2) I want to schedule the spider on Heroku to send an email with the downloaded files. How do I do so for free?

From https://github.com/yashashreesuresh/amaz...ce_tracker, I know we can use smtplib to let the spider send email.

However, for the downloaded file, where will the file be located in if we deploy the spider from Heroku?

Thank you
Reply
#2
First, are you sure that this is something you should be doing on Heroku? I mean, do their terms of use say you shouldn't?

Note that the file system in your app's dyno is ephemeral - see this for more info. If you want more permanent storage, look at a database or some cloud storage service like Amazon S3, Google Cloud Storage, etc.
Reply
#3
(Mar-24-2022, 06:31 AM)ndc85430 Wrote: First, are you sure that this is something you should be doing on Heroku? I mean, do their terms of use say you shouldn't?

Note that the file system in your app's dyno is ephemeral - see this for more info. If you want more permanent storage, look at a database or some cloud storage service like Amazon S3, Google Cloud Storage, etc.

Hi

Thanks for the reply. I know the file system is ephemeral. So if the spider click some button on the website to trigger a file download, where will the file be saved? I know I can create code in the spider to send an email using SMTP_SSL, for example, but I will need the file path to attach it to the email.

Thanks
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
Star Leapcell: The Python-Friendly Alternative to Heroku + Airtable Hybrid IssacChan 1 471 Feb-01-2024, 06:00 AM
Last Post: Athi
  Heroku Error H10 jamesaarr 1 2,033 Oct-21-2021, 03:43 PM
Last Post: jamesaarr
  Importing Postgres Heroku from AWS S3 Drone4four 0 1,809 May-27-2021, 01:09 PM
Last Post: Drone4four
  Django project deployed to Heroku: Postgres security Drone4four 0 1,942 Mar-26-2021, 10:17 AM
Last Post: Drone4four
  Django send email - email form Remek953 2 2,306 Sep-18-2020, 07:07 AM
Last Post: Remek953
  Parse data from downloaded html nikos48 7 3,640 Jan-26-2020, 03:35 PM
Last Post: nikos48
  Why doesn't my spider find body text? sigalizer 5 4,364 Oct-30-2019, 11:35 PM
Last Post: sigalizer
  Scrape script when deployed on heroku not giving expected outcomes alok001 1 3,258 Oct-19-2019, 12:39 PM
Last Post: alok001
  flask app to save images locally when deployed on heroku not working Prince_Bhatia 1 5,276 Feb-20-2019, 11:59 PM
Last Post: snippsat
  Deploy flask app on Heroku Prince_Bhatia 0 4,040 Feb-20-2019, 09:05 AM
Last Post: Prince_Bhatia

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020