Python Forum
extract data inside a table from a .doc file
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
extract data inside a table from a .doc file
#3
no, i am not asking to have the job done i would do it by myself

i was asking to suggestion because i am unable to even open correctly a single file

so far i tried to
1) convert it using antiword -> didn't work
2) open with textract, i discovered that it used antiword so -> didn't work
3) tested to convert the file with soffice --convert-to odt *.doc better then before but -> didn't work
4) tested about another 3/4 method found on google but any worked

but now i think i found the problem, it is that i need to take some data from the heading
and it is treated as a "outside the margin" in word file so any of this method "see" it

if someone wants to try something: test file
Reply


Messages In This Thread
RE: extract data inside a table from a .doc file - by aster - Feb-28-2018, 12:58 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Why can't it extract the data from .txt well? Melcu54 4 1,914 Dec-12-2024, 07:36 PM
Last Post: Melcu54
  JSON File - extract only the data in a nested array for CSV file shwfgd 2 1,217 Aug-26-2024, 10:14 PM
Last Post: shwfgd
  Python script to extract data from API to database melpys 0 1,111 Aug-12-2024, 05:53 PM
Last Post: melpys
  Extract and rename a file from an Archive tester_V 4 4,183 Jul-08-2024, 07:54 AM
Last Post: tester_V
  Is it possible to extract 1 or 2 bits of data from MS project files? cubangt 8 4,166 Feb-16-2024, 12:02 AM
Last Post: deanhystad
  Navigating file directories and paths inside Jupyter Notebook Mark17 5 10,052 Oct-29-2023, 12:40 PM
Last Post: Mark17
  Using pyodbc&pandas to load a Table data to df tester_V 3 3,172 Sep-09-2023, 08:55 PM
Last Post: tester_V
  Extract file only (without a directory it is in) from ZIPIP tester_V 1 4,524 Jan-23-2023, 04:56 AM
Last Post: deanhystad
  extract table from multiple pages sshree43 8 10,152 Dec-12-2022, 10:34 AM
Last Post: arvin
  Reading All The RAW Data Inside a PDF NBAComputerMan 4 3,389 Nov-30-2022, 10:54 PM
Last Post: Larz60+

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020