Beautifulsoup Scraping

***snippsat*** · (This post was last modified: Jun-21-2019, 11:48 AM by snippsat.)

(Jun-21-2019, 10:26 AM)PolskaYBZ Wrote: I have added to the weblink "&page=(page)", but this does not seem to work as it only scrape one page.

The link is a little wrong and can use f-string to insert into link string.

for page in range (1,4):
    link = f"https://www.otodom.pl/wynajem/mieszkanie/krakow/?search%5Bdescription%5D=1&search%5Bsubregion_id%5D=410&search%5Bcity_id%5D=38&page={page}"
    print(link)

Now can you test output and see that eg link 2 switch to page 2.

Quote:I would like to retrieve the data-id from the below HTML code. But this is a little tricky as it seems it is not part from the class and therefore I am not able to retrieve it like hp

data-id is an attribute in link a,so can use attrs to get it.

from bs4 import BeautifulSoup

html = '''\
<a class="button-observed observe-link favourites-button observed-text svg-heart add-to-favourites" data-statkey="ad.observed.list" rel="nofollow" data-id="45264798" href="#" title="Obserwuj">
<div class="observed-text-container" style="display: flex;">'''

soup = BeautifulSoup(html, 'lxml')
link_a = soup.find('a')

>>> link_a.attrs
{'class': ['button-observed',
           'observe-link',
           'favourites-button',
           'observed-text',
           'svg-heart',
           'add-to-favourites'],
 'data-id': '45264798',
 'data-statkey': 'ad.observed.list',
 'href': '#',
 'rel': ['nofollow'],
 'title': 'Obserwuj'}

>>> link_a.attrs['data-id']
'45264798'

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Scraping based on years BeautifulSoup	rhat398	0	1,849	May-22-2021, 07:20 PM Last Post: rhat398
	Combining selenium and beautifulsoup for web scraping	sumandas89	3	11,849	Jan-30-2018, 02:14 PM Last Post: metulburr
	Scraping with BeautifulSoup	Prince_Bhatia	8	7,223	Sep-07-2017, 06:34 PM Last Post: Prince_Bhatia

Beautifulsoup Scraping

User Panel Messages

Announcements