Beautiful Soup (suddenly) doesn't get full webpage html

HarleyQuin · Jul-11-2020, 02:04 PM

(Jul-11-2020, 11:52 AM)j.crater Wrote: Thank you both for answers.

@HarleyQuin
The code I ran months ago was same as I posted here, but result was not same. As stated, on my first attempt I got all the HTML contents, while this time I didn't. Also, replacing the parser for lxml parser didn't make a difference. Do you have any idea, from experience, why such difference?

Hey again,

From experience i have noticed that not using a user-agent/header makes it very easy for YouTube to immediately identify you as a web scraper and deal with your request connection differently to how a conventional user may be welcomed by the site. That is something that made a difference when i first started scraping.

e.g. i use this in my code:

import requests

headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.116 Safari/537.36',
    "Content-Type": "application/x-www-form-urlencoded"}

url = "https://whatsmyua.info/"

webpage = requests.get(url, headers=headers).text
print(webpage)

Sorry if i have been of no use!

I hope you solve your issue buddy,

Regards,

Harley

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Selenium suddenly fails to find element	Pavel_47	3	6,339	Sep-04-2022, 11:06 AM Last Post: Pavel_47
	Python Obstacles \| Kung-Fu \| Full File HTML Document Scrape and Store it in MariaDB	BrandonKastning	5	2,924	Dec-29-2021, 02:26 AM Last Post: BrandonKastning
	Beautiful Soup - access a rating value in a class	KatMac	1	3,479	Apr-16-2021, 01:27 PM Last Post: snippsat
	HTML multi select HTML listbox with Flask/Python	rfeyer	0	4,654	Mar-14-2021, 12:23 PM Last Post: rfeyer
	Beginner web scraping/Beautiful Soup help	7ken8	2	2,627	Jan-28-2021, 04:26 PM Last Post: 7ken8
	Help: Beautiful Soup - Parsing HTML table	ironfelix717	2	2,703	Oct-01-2020, 02:19 PM Last Post: snippsat
	Requests-HTML vs Beautiful Soup - How to Choose?	robin73	0	3,833	Jun-23-2020, 02:53 PM Last Post: robin73
	Python3 + BeautifulSoup4 + lxml (HTML -> CSV) - How to loop to next HTML/new CSV Row	BrandonKastning	0	2,379	Mar-22-2020, 06:10 AM Last Post: BrandonKastning
	looking for direction - scrappy, crawler, beautiful soup	Sly_Corn	2	2,469	Mar-17-2020, 03:17 PM Last Post: Sly_Corn
	Beautiful soup truncates results	jonesjoz	4	3,897	Mar-09-2020, 06:04 PM Last Post: jonesjoz

Beautiful Soup (suddenly) doesn't get full webpage html

User Panel Messages

Announcements