Jul-12-2018, 04:09 PM
.select()
lets you do css selectors, so you can just use that: >>> import bs4 >>> html = ''' ... <h2 class="1">section1</h2> ... <p class="2">article1</p> ... <p>article2</p> ... <p class="3">article3</p> ... <h1> Lorem Ipsum</h1> ... <p> 3 Lorem ipsum dolor </p>",'lxml') ... ''' >>> >>> soup = bs4.BeautifulSoup(html, 'html.parser') >>> soup.select(".1, .2") [<h2 class="1">section1</h2>, <p class="2">article1</p>]