Thanks for the pointers to PyMuPDF and PyPDF2.
I know the title on page 3 always uses Arial bold 19,5 pt, but it can stretch to two or three lines
What would be the simplest way to get the string?
Can those packages build a tree that I can navigate à la lxml?
I know the title on page 3 always uses Arial bold 19,5 pt, but it can stretch to two or three lines
What would be the simplest way to get the string?
Can those packages build a tree that I can navigate à la lxml?