Helping out a friend - simple question - Printable Version +- Python Forum (https://python-forum.io) +-- Forum: Python Coding (https://python-forum.io/forum-7.html) +--- Forum: Web Scraping & Web Development (https://python-forum.io/forum-13.html) +--- Thread: Helping out a friend - simple question (/thread-17134.html) |
Helping out a friend - simple question - Nector33 - Mar-29-2019 Hi, I'm scrapping a website and I don't know how to print only part for the text. from bs4 import BeautifulSoup import requests source = requests.get('http://www.website.com').text soup = BeautifulSoup(source, 'lxml') title = soup.find('div', class_='pr_title') print(title.text) sku = soup.find('div', class_='pr_infos') print(sku.text)The textx that comes out is like this : 2008 xxxxx 7xxxxx 2 xxxx, xxx-xxx" Sxxxxxx ENGINE : Diesel CAP: x,xxx lbs SKU: 1234 I just whant the ; " SKU: 1234 " Thanks RE: Helping out a friend - simple question - Larz60+ - Mar-29-2019 You probably need to drill down on the div. Change your code by commenting out lines 9 and 12 and add this line after line 12: print(BeautifulSoup.prettify(sku))And post results in output tags RE: Helping out a friend - simple question - Nector33 - Mar-30-2019 thanks RE: Helping out a friend - simple question - Larz60+ - Mar-30-2019 how about this part? Quote:And post results in output tags RE: Helping out a friend - simple question - Nector33 - Mar-30-2019 sorry I don't understand. So I subscribed to : - https://www.reddit.com/r/learnpython/ - amd continuing education here https://automatetheboringstuff.com/chapter11/ Regards RE: Helping out a friend - simple question - Larz60+ - Mar-30-2019 Let me explain:
you are getting all that test because tour div statement has children which have their own text. And because of this, multiple lines of text. If I could see what the div statement was composed of, then it would be possible to drill down to the actual 'sku' text. |