how to print out all the link <a> under each h2 section using beautifulsoup

***metulburr*** · (This post was last modified: Feb-02-2018, 02:51 AM by metulburr.)

you've got it most of the way. your find_all h2s are including the h2 tag, so you need to find the a tag after that h2.a or h2.find('a') and then you need to get the text and strip all whitespace from the outer edges of it.

from bs4 import BeautifulSoup

html = '''
<div>
<h2>
<a href='xxxx'>
The content I want to print out 1
</a>
</h2>
<div>

<div>
<h2>
<a href='xxxx'>
The content I want to print out 2
</a>
</h2>
<div>
'''

soup = BeautifulSoup(html,"html.parser")
h2s = soup.find_all("h2")
for h2 in h2s:
    print(h2.a.text.strip())

Output:The content I want to print out 1
The content I want to print out 2

if you wanted to get the actual link

    print(h2.a['href'])

Output:xxxx
xxxx

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	web scraping extract particular Div section	AjayBachu	7	10,378	May-12-2020, 03:24 PM Last Post: AjayBachu
	Web scraping read particular section	AjayBachu	4	3,074	May-08-2020, 07:33 AM Last Post: AjayBachu
	get link and link text from table	metulburr	5	6,284	Jun-13-2019, 07:50 PM Last Post: snippsat
	Monitor a section of a webpage for changes	yeto	1	3,150	Dec-05-2017, 08:09 PM Last Post: nilamo

how to print out all the link <a> under each h2 section using beautifulsoup

User Panel Messages

Announcements