Python Forum
How to clean html content using BeautifulSoup in Python 3.6?
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
How to clean html content using BeautifulSoup in Python 3.6?
#6
(Apr-27-2018, 07:05 AM)PrateekG Wrote: what if I don't want include <img>, <iframe> tags?
The have to do second cleaning,to clean tags inside other tag.
Now it start to get complex,usually this is the other way around.
Which mean that you parse date you do want,an not like now try filter out all data that's not wanted Doh
Reply


Messages In This Thread
RE: How to clean html content using BeautifulSoup in Python 3.6? - by snippsat - Apr-27-2018, 01:14 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Extracting content from a website using Python? SandraYokum 2 408 May-27-2024, 03:30 AM
Last Post: Davidleo
  Strange ModuleNotFound Error on BeautifulSoup for Python 3.11 Gaberson19 1 1,161 Jul-13-2023, 10:38 AM
Last Post: Gaurav_Kumar
  Retrieve website content using Python? Vadanane 1 1,383 Jan-16-2023, 09:55 AM
Last Post: Axel_Erfurt
  Getting a URL from Amazon using requests-html, or beautifulsoup aaander 1 1,785 Nov-06-2022, 10:59 PM
Last Post: snippsat
  requests-html + Beautifulsoup klaarnou 0 2,504 Mar-21-2022, 05:31 PM
Last Post: klaarnou
  Python Obstacles | Krav Maga | Wiki Scraped Content [Column Copy] BrandonKastning 4 2,343 Jan-03-2022, 06:59 AM
Last Post: BrandonKastning
  Python Obstacles | Kapap | Wiki Scraped Content [Column Nulling] BrandonKastning 2 1,817 Jan-03-2022, 04:26 AM
Last Post: BrandonKastning
  Python BeautifulSoup gives unusable text? dggo666 0 1,476 Oct-29-2021, 05:12 AM
Last Post: dggo666
  Python Web Scraping can not getting all HTML content yqqwe123 0 1,706 Aug-02-2021, 08:56 AM
Last Post: yqqwe123
  Python BeautifulSoup IndexError: list index out of range rhat398 1 6,360 May-28-2021, 09:09 PM
Last Post: Daring_T

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020