Python Forum
extract data inside a table from a .doc file
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
extract data inside a table from a .doc file
#9
here is example that reads the test file if converted to docx
from docx import Document

document = Document('test.docx')
tbl = document.tables[0]
for rw in tbl.rows:
    if rw.cells[0].text.startswith('CONCLUSIONI:'):
        print(rw.cells[0].text)
Output:
CONCLUSIONI: 16. Quadro microcircolatorio con segni aspecifici ma evidenti compa tibili con connettivopatia 22. Quadro microcircolatorio compatibile con Raynaud 42. Si consiglia visita reumatologica ed eseguire i seguenti esami di laboratori o: esame emocromocitometrico completo con formula leucocitaria, VES, PCR, Reuma Te st, C3, C4, ANA, ANCA,ASMA, ENA Profile, Sideremia, ferritinemia. >>>
Reply


Messages In This Thread
RE: extract data inside a table from a .doc file - by buran - Feb-28-2018, 02:07 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Why can't it extract the data from .txt well? Melcu54 4 1,902 Dec-12-2024, 07:36 PM
Last Post: Melcu54
  JSON File - extract only the data in a nested array for CSV file shwfgd 2 1,205 Aug-26-2024, 10:14 PM
Last Post: shwfgd
  Python script to extract data from API to database melpys 0 1,104 Aug-12-2024, 05:53 PM
Last Post: melpys
  Extract and rename a file from an Archive tester_V 4 4,162 Jul-08-2024, 07:54 AM
Last Post: tester_V
  Is it possible to extract 1 or 2 bits of data from MS project files? cubangt 8 4,128 Feb-16-2024, 12:02 AM
Last Post: deanhystad
  Navigating file directories and paths inside Jupyter Notebook Mark17 5 9,972 Oct-29-2023, 12:40 PM
Last Post: Mark17
  Using pyodbc&pandas to load a Table data to df tester_V 3 3,143 Sep-09-2023, 08:55 PM
Last Post: tester_V
  Extract file only (without a directory it is in) from ZIPIP tester_V 1 4,495 Jan-23-2023, 04:56 AM
Last Post: deanhystad
  extract table from multiple pages sshree43 8 10,119 Dec-12-2022, 10:34 AM
Last Post: arvin
  Reading All The RAW Data Inside a PDF NBAComputerMan 4 3,380 Nov-30-2022, 10:54 PM
Last Post: Larz60+

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020