Python Forum

Full Version: Scraping all website text using Python
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
I am very very new to Python at all (so sorry in advance for asking stupid questions). I have an excel sheet with a unique company identifier and the respective URLs next to it for a couple of companies.

What I would like to do is to open the URL and save all the website text (the complete text from the first page of the website) for each of the companies to a separate .txt-file. The name of the file should be the unique identifier from the excel sheet.

Did someone of you something similar in the past or could help me with the code on that task?

That would be great!!
I suggest that you go through snippsat's web scraping tutorials here:
web scraping part 1
web scraping part 2