Python Forum
Beautiful Soup - Delete All HTML - Except Specific Classes
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Beautiful Soup - Delete All HTML - Except Specific Classes
#1
Hi all,

I have been looking everyhwere for this concept.

I wanted to delete all html except for the classes. I have listed
The idea is below. The code is not correct

html = '''\
<h2 class="1">section1</h2>
<p class="2">article1</p>
<p>article2</p>
<p class="3">article3</p>
<h1> Lorem Ipsum</h1>
<p> 3 Lorem ipsum dolor </p>",'lxml')
'''

soup = BeautifulSoup(html, 'lxml')


for tag in soup():
   if not class in ["1", "2"]:
        tag.decompose()
print(soup)
I cant find any code samples to show me this idea

Basically delete all html except for those classes listed in the list

Result:
Everything Deleted except:
<h2 class="1">section1</h2>
<p class="2">article1</p>
<p class="3">article3</p>

please do advise thank you
Reply


Messages In This Thread
Beautiful Soup - Delete All HTML - Except Specific Classes - by dj99 - Jul-12-2018, 03:28 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Python Obstacles | Karate | HTML/Scrape Specific Tag and Store it in MariaDB BrandonKastning 8 3,161 Nov-22-2021, 01:38 AM
Last Post: BrandonKastning
  Beautiful Soup - access a rating value in a class KatMac 1 3,457 Apr-16-2021, 01:27 PM
Last Post: snippsat
  HTML multi select HTML listbox with Flask/Python rfeyer 0 4,617 Mar-14-2021, 12:23 PM
Last Post: rfeyer
  *Beginner* web scraping/Beautiful Soup help 7ken8 2 2,601 Jan-28-2021, 04:26 PM
Last Post: 7ken8
  Beautiful Soap can't find a specific section on the page Pavel_47 1 2,419 Jan-18-2021, 02:18 PM
Last Post: snippsat
  Help: Beautiful Soup - Parsing HTML table ironfelix717 2 2,669 Oct-01-2020, 02:19 PM
Last Post: snippsat
  Beautiful Soup (suddenly) doesn't get full webpage html j.crater 8 16,797 Jul-11-2020, 04:31 PM
Last Post: j.crater
  Requests-HTML vs Beautiful Soup - How to Choose? robin73 0 3,810 Jun-23-2020, 02:53 PM
Last Post: robin73
  Python3 + BeautifulSoup4 + lxml (HTML -> CSV) - How to loop to next HTML/new CSV Row BrandonKastning 0 2,357 Mar-22-2020, 06:10 AM
Last Post: BrandonKastning
  looking for direction - scrappy, crawler, beautiful soup Sly_Corn 2 2,446 Mar-17-2020, 03:17 PM
Last Post: Sly_Corn

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020