Python Forum
What's a good practice project for learning BeautifulSoup4, which has a real use case
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
What's a good practice project for learning BeautifulSoup4, which has a real use case
#1
I want to get good at using BS4 in my scripts and want to do it by working on scripts that have some kind of real use. And I want to start simple.
Reply
#2
Chose what you want to get from a website and start coding. Start with something simple. Get the webpage title, then the header, after that all the paragraphs, all the links. Without a goal, it's hard to learn. Just reading the documentation is not gonna happen. If you have some questions or face obstacles feel free to share them here. We are willing to help.

For example, my first web scrapping script was to gather emails from a website for a bunch of people because there was not any document containing that info.

There are easy to follow tutorials here:
https://python-forum.io/Thread-Web-Scraping-part-1
https://python-forum.io/Thread-Web-scraping-part-2

You may start with them. Thanks to @snippsat.
"As they say in Mexico 'dosvidaniya'. That makes two vidaniyas."
https://freedns.afraid.org
Reply
#3
I just got a project idea inadvertently.

I found out that I can download my entire Google account history, or just parts of it. The use history is in html files inside subfolders.

These html files display just fine in a browser, but it's impossible to do anything useful with the data that way.

I looked at one of the files in a text editor and the html is HIDEOUS. I want to develop an automated way to extract the useful data.

But first I'll have a look at those tutorials you linked to see what they contain, that can help me.
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Beautifulsoup4 help samuelbachorik 1 1,326 Feb-05-2022, 10:44 PM
Last Post: snippsat
  Python3 + BeautifulSoup4 + lxml (HTML -> CSV) - How to write 3 Columns to MariaDB? BrandonKastning 21 6,712 Mar-23-2020, 05:51 PM
Last Post: ndc85430
  BeautifulSoup4 plugin help Lathem01 2 1,985 Feb-16-2020, 11:56 AM
Last Post: snippsat
  Cannot import BeautifulSoup4 as bs4 and BeautifulSoup4 not in same directory B5473829 1 6,335 Jul-05-2019, 06:55 PM
Last Post: snippsat
  BeautifulSoup4, How to get an HTML tag with specific class. Broadsworde 6 10,823 Nov-22-2018, 05:25 PM
Last Post: snippsat
  How to use BeautifulSoup4 with pandas series type of html data? PrateekG 4 4,854 Apr-26-2018, 07:33 AM
Last Post: PrateekG

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020