How to read what's written in THIS specific page ?

***stranac*** · Sep-13-2018, 05:16 PM

In general, there are 2 common approaches to scraping a website that uses javascript:

Opening the page using an actual web browser (e.g. using selenium)
Figuring out what the page is doing, and emulating that in your code

Both approaches are usable for your website.

The former requires less work, as you just load the website in a browser and deal with the resulting HTML.
The latter requires some digging, but it usually results in more efficient code, and it doesn't require you to run a full browser.

For this particular page, I used my browser's dev tools to find an XHR request that loads the data.
Knowing where the data comes from makes getting the information as simple as making a single request (using requests):

>>> r = requests.get(
...     'http://greyhoundbet.racingpost.com/card/blocks.sd?race_id=1638926&r_date=2018-09-13&blocks=form',
...     headers={
...         'User-Agent': 'Mozilla/5.0',
...     }
... )
>>> data = r.json()
>>> [dog['dogName'] for dog in data['form']['dogs']]
['Lobors Ferrett', 'Cairns Cilla', 'Power Diva', 'Artic Image', 'Millbank Gem', 'Market Centre']

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Beautiful Soap can't find a specific section on the page	Pavel_47	1	2,442	Jan-18-2021, 02:18 PM Last Post: snippsat
	use Xpath in Python :: libxml2 for a page-to-page skip-setting	apollo	2	3,652	Mar-19-2020, 06:13 PM Last Post: apollo
	[Python 3] - Extract specific data from a web page using lxml module	Takeshio	9	7,169	Aug-25-2018, 08:46 AM Last Post: leotrubach
	urllib urlopen getting error 400 on 1 specific page	glidecode	4	4,147	Mar-01-2018, 11:01 PM Last Post: glidecode

How to read what's written in THIS specific page ?

User Panel Messages

Announcements