Jul-12-2018, 04:00 PM
Can use
extract()
and set class to True.from bs4 import BeautifulSoup html = '''\ <h2 class="1">section1</h2> <p class="2">article1</p> <p>article2</p> <p class="3">article3</p> <h1> Lorem Ipsum</h1> <p> 3 Lorem ipsum dolor </p>",'lxml') ''' soup = BeautifulSoup(html, 'lxml') for tag in soup.find_all(['h2', 'p'], class_=True): print(tag.extract())
Output:<h2 class="1">section1</h2>
<p class="2">article1</p>
<p class="3">article3</p>