Python Forum
Convert a PDF files to HTML files
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Convert a PDF files to HTML files
#1
Hi everyone,

I am a new user and I started to learn Python to do something in particulary.

I would like to convert PDF Files automatically to HTML files but I don't know how to do this.

I tried to use somes library but to be honest I have lot of difficult to understand and to be capable to reproduce the example.

So if someone can help me to convert thoose PDF Files in HTML document, it would be nice of you.

Thank you for you're helping.
Reply
#2
PDF is a great way to view data, but one of the most difficult for extracting data.
There are some packages that make the conversion easier (sometimes, depending on PDF format)
One such for PDF to html is pdftotree: https://pypi.org/project/pdftotree/

There are others (you will have to reasearch to find suitable match) https://pypi.org/search/?q=%22PDF+to+HTML%22&o=
Reply
#3
(Oct-20-2020, 10:08 AM)Larz60+ Wrote: PDF is a great way to view data, but one of the most difficult for extracting data.
There are some packages that make the conversion easier (sometimes, depending on PDF format)
One such for PDF to html is pdftotree: https://pypi.org/project/pdftotree/

There are others (you will have to reasearch to find suitable match) https://pypi.org/search/?q=%22PDF+to+HTML%22&o=

Thank you for your answer.

I am going to check this package.
I hope there will be a very detailed example because I am a beginner in python programming
Reply
#4
I'm sorry but I can't understand how it works.

I am trying to use this package but I don't know what I need to install. I am using Pycharm with Python 3.8
Reply
#5
Quote:I am trying to use this package but I don't know what I need to install
pip install pdftotree

documentation and examples: https://github.com/HazyResearch/pdftotree
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  [SOLVED] Loop through directories and files one level down? Winfried 3 184 Apr-28-2024, 02:31 PM
Last Post: Gribouillis
  Loop through all files in a directory? Winfried 10 464 Apr-23-2024, 07:38 PM
Last Post: FortuneCoins
Question Right way to open files with different encodings? Winfried 2 269 Apr-23-2024, 05:50 PM
Last Post: snippsat
  Open files in an existing window instead of new Kostov 2 348 Apr-13-2024, 07:22 AM
Last Post: Kostov
  Using zipfile module - finding folders not files darter1010 2 294 Apr-06-2024, 07:22 AM
Last Post: Pedroski55
  Is it possible to extract 1 or 2 bits of data from MS project files? cubangt 8 1,087 Feb-16-2024, 12:02 AM
Last Post: deanhystad
  File loop curiously skipping files - FIXED mbk34 10 865 Feb-10-2024, 07:08 AM
Last Post: buran
  Copy Paste excel files based on the first letters of the file name Viento 2 466 Feb-07-2024, 12:24 PM
Last Post: Viento
  Class test : good way to split methods into several files paul18fr 4 505 Jan-30-2024, 11:46 AM
Last Post: Pedroski55
  uploading files from a ubuntu local directory to Minio storage container dchilambo 0 479 Dec-22-2023, 07:17 AM
Last Post: dchilambo

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020