Aug-30-2022, 12:57 AM
Thanks for the info!
I always use Libre Office, except when I need to interact with Python, then I save things in Excel or Word format.
It is noticeable that Excel or Word documents are smaller than their Libre Office counterparts.
Once, I made a typo while saving and saved a document like this: test.docx_
Do that, then look at the file, you have a zip file. Open that.
The document.xml contains the text, just need a parser to get the text from the <w:t>
<w:t>Hello me.</w:t>
Any other stuff must be in there, just a matter of parsing the right xml tags.
Python probably has an xml parser!
I always use Libre Office, except when I need to interact with Python, then I save things in Excel or Word format.
It is noticeable that Excel or Word documents are smaller than their Libre Office counterparts.
Once, I made a typo while saving and saved a document like this: test.docx_
Do that, then look at the file, you have a zip file. Open that.
The document.xml contains the text, just need a parser to get the text from the <w:t>
<w:t>Hello me.</w:t>
Any other stuff must be in there, just a matter of parsing the right xml tags.
Python probably has an xml parser!