Python Forum
extracting data/strings from Word doc
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
extracting data/strings from Word doc
#1
Thumbs Down 
Hi

I've been looking at tutorials on how to extract data and strings from word documents and in some cases also the data and strings from a table in a word document.

However, the tutorials always show a simple word document with a simple layout. I've been trying to learn this because I need to do this for 1000 of word documents that are generated from a webpage.

Unfortunately, the document that is generated has tables inside of other tables and this is where the "standard" tutorials don't work anymore.

I've tried using Docx and can see the loaded word documents in pycharm, but I can't seem to get the strings and int from the tables inside other tables.

I've posted a link to the word document on google drive if anyone wants to see the terrible generated word document Tongue

Word document

Hope someone can help me.

Best Regards Mikkel
Reply
#2
Look here: https://pypi.org/search/?q=msword&o=
I can't recommend any of these as I have not used them, but you should be able to find something useful.
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Trying to understand strings and lists of strings Konstantin23 2 761 Aug-06-2023, 11:42 AM
Last Post: deanhystad
  Extracting Data into Columns using pdfplumber arvin 17 5,559 Dec-17-2022, 11:59 AM
Last Post: arvin
  Splitting strings in list of strings jesse68 3 1,766 Mar-02-2022, 05:15 PM
Last Post: DeaD_EyE
  Extracting Data from tables DataExtrator 0 1,145 Nov-02-2021, 12:24 PM
Last Post: DataExtrator
Question Problem: Check if a list contains a word and then continue with the next word Mangono 2 2,495 Aug-12-2021, 04:25 PM
Last Post: palladium
  extracting data ajitnayak1987 1 1,535 Jul-29-2021, 06:13 AM
Last Post: bowlofred
  Extracting and printing data ajitnayak1987 0 1,410 Jul-28-2021, 09:30 AM
Last Post: ajitnayak1987
  Extracting unique pairs from a data set based on another value rybina 2 2,307 Feb-12-2021, 08:36 AM
Last Post: rybina
  Extracting data without showing dtype, name etc. tgottsc1 3 4,402 Jan-10-2021, 02:15 PM
Last Post: buran
  Extracting data from a website tgottsc1 2 2,271 Jan-09-2021, 08:14 PM
Last Post: tgottsc1

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020