Python Forum
export PDF to excel or csv - Printable Version

+- Python Forum (https://python-forum.io)
+-- Forum: Python Coding (https://python-forum.io/forum-7.html)
+--- Forum: General Coding Help (https://python-forum.io/forum-8.html)
+--- Thread: export PDF to excel or csv (/thread-15495.html)



export PDF to excel or csv - prem2179 - Jan-19-2019

I want to extract tables from PDF and extract it to CSV. i have tried:

1. tabula: It gives error: java` command is not found from this Python process. Please ensure Java is installed and PATH is set for `java
Have installed Java and set the environment also. but not working.
2. camelot: It is also showing error:  module 'camelot' has no attribute 'read_pdf'
3. import extract_table_data: Not giving correct results.. it shows: [[['', 'EAN', 'DESC'],
  ['', '1234 ABC', ''],
  ['', '2345 PQR', ''],
  ['', '5678 LMN', ''],
  ['', '6790 XYZ PQ', ''],
  ['', '3423 AABB', ''],
  ['', '1212 CCDD', '']]]



RE: export PDF to excel or csv - Larz60+ - Jan-19-2019

you can use reportlab for this see: https://www.blog.pythonlibrary.org/2018/05/03/exporting-data-from-pdfs-with-pytho