Python Forum
cleaning HTML pages using lxml and XPath
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
cleaning HTML pages using lxml and XPath
#3
In my case, HTML markup is still required. The goal is not to extract a plain text from HTML , but cleaning HTML from unnecessary elements. I need prepare html for e-book format, it support HTML and CSS styling.
Reply


Messages In This Thread
RE: cleaning HTML pages using lxml and XPath - by wenkos - Aug-25-2021, 10:54 AM

Possibly Related Threads…
Thread Author Replies Views Last Post
Bug Need Pointers/Advise for Cleaning up BS4 XPATH Data BrandonKastning 0 1,347 Mar-08-2022, 12:28 PM
Last Post: BrandonKastning
  HTML multi select HTML listbox with Flask/Python rfeyer 0 4,876 Mar-14-2021, 12:23 PM
Last Post: rfeyer
  Cleaning HTML data using Jupyter Notebook jacob1986 7 4,362 Mar-05-2021, 10:44 PM
Last Post: snippsat
  Python3 + BeautifulSoup4 + lxml (HTML -> CSV) - How to write 3 Columns to MariaDB? BrandonKastning 21 7,416 Mar-23-2020, 05:51 PM
Last Post: ndc85430
  Python3 + BeautifulSoup4 + lxml (HTML -> CSV) - How to loop to next HTML/new CSV Row BrandonKastning 0 2,457 Mar-22-2020, 06:10 AM
Last Post: BrandonKastning
  need help with xpath pythonprogrammer 1 2,848 Jan-18-2020, 11:28 PM
Last Post: snippsat
  non-finite value error when cleaning data yokaso 0 3,414 Dec-17-2019, 07:26 AM
Last Post: yokaso
  [Help]xpath is not working with lxml mr_byte31 3 6,444 Jul-22-2018, 04:10 PM
Last Post: stranac
  Need Tip On Cleaning My BS4 Scraped Data digitalmatic7 2 3,310 Jan-29-2018, 08:49 PM
Last Post: digitalmatic7

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020