Python Forum
web scraping for new additions/modifed website?
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
web scraping for new additions/modifed website?
#2
Yes,can eg save the the state of want to check to disk.
from bs4 import BeautifulSoup

html = '''\
<body>
  <h1>The img element</h1>
  <img src="img_girl.jpg" alt="Girl in a jacket" width="500" height="600">
</body>
</html>'''

soup = BeautifulSoup(html, 'lxml')
img_tag = soup.select_one('img')['alt'] # Girl in a jacket
with open('img_tag.txt', 'w') as f_out:
    f_out.write(img_tag)
Then check like this.
from bs4 import BeautifulSoup

html = '''\
<body>
  <h1>The img element</h1>
  <img src="img_girl.jpg" alt="Apple in snow" width="500" height="600">
</body>
</html>'''

soup = BeautifulSoup(html, 'lxml')
img_tag = soup.select_one('img')['alt']
with open('img_tag.txt') as f:
    old_tag = f.read()
    if old_tag == img_tag:
        print('No update')
    else:
        print(f'New image update: <{img_tag}>')
Output:
New image update: <Apple in snow>
Can run manually or automate in a schedule way eg Python job scheduling for humans.
Reply


Messages In This Thread
RE: web scraping for new additions/modifed website? - by snippsat - Apr-11-2022, 08:57 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Scraping lender data from Ren Ren Dai website using Python. I will pay for that 200$ Hafedh_2021 1 2,818 May-18-2021, 08:41 PM
Last Post: snippsat
  Scraping all website text using Python MKMKMKMK 1 2,156 Nov-26-2020, 10:35 PM
Last Post: Larz60+
  Scraping a Website (HELP) LearnPython2 1 1,822 May-08-2020, 03:20 PM
Last Post: Larz60+
  scraping from a website that hides source code PIWI_Protein 1 2,040 Mar-27-2020, 05:08 PM
Last Post: Larz60+
  Scraping not moving to the next pages in a website jithin123 0 2,029 Mar-23-2020, 06:10 PM
Last Post: jithin123
  Random Loss of Control of Website When Scraping bmccollum 0 1,576 Aug-30-2019, 04:04 AM
Last Post: bmccollum
  MaxRetryError while scraping a website multiple times kawasso 6 17,737 Aug-29-2019, 05:25 PM
Last Post: kawasso
  scraping multiple pages of a website. Blue Dog 14 22,698 Jun-21-2018, 09:03 PM
Last Post: Blue Dog
  Scraping number in % from website santax 3 4,578 Mar-19-2017, 12:22 PM
Last Post: santax

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020