Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Python Looping
#2
This is a site that doesn't want to be scraped, robots.txt below:
Output:
User-agent: * Disallow: /*?list Disallow: /pdf/* Disallow: /thumb.php Disallow: /basket* Disallow: /register$ Disallow: /profile* Disallow: /battery/* Disallow: /oil/* Disallow: /terms-conditions* Disallow: /privacy-policy* Disallow: /company* Disallow: /rims/* Disallow: /tires/* Disallow: /return-of-goods* Disallow: /*?keyword Disallow: /spares-search.html
You should contact the company as they may have a way you could get access to their catalog for local copy.
Reply


Messages In This Thread
Python Looping - by dnetvaggos - Dec-07-2018, 09:51 PM
RE: Python Looping - by Larz60+ - Dec-08-2018, 12:53 AM
RE: Python Looping - by dnetvaggos - Dec-08-2018, 08:09 AM
RE: Python Looping - by Larz60+ - Dec-08-2018, 11:20 AM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Looping actions in an iframe using Selenium on Python amyd 3 3,049 Mar-06-2019, 08:31 PM
Last Post: Larz60+
  Unable to print data while looping through list in csv for webscraping - Python Prince_Bhatia 1 3,519 Oct-04-2017, 11:18 AM
Last Post: wavic

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020