Aug-25-2021, 10:54 AM
In my case, HTML markup is still required. The goal is not to extract a plain text from HTML , but cleaning HTML from unnecessary elements. I need prepare html for e-book format, it support HTML and CSS styling.
cleaning HTML pages using lxml and XPath
|
|
Messages In This Thread |
cleaning HTML pages using lxml and XPath - by wenkos - Aug-24-2021, 03:44 PM
RE: cleaning HTML pages using lxml and XPath - by snippsat - Aug-24-2021, 05:51 PM
RE: cleaning HTML pages using lxml and XPath - by wenkos - Aug-25-2021, 10:54 AM
|