Sep-02-2023, 06:42 AM
Hi,
In some pdfs I encounter references to the original parish register, like so: ref = ' RP 477; p. 148 r° '
I perform unidecode on all strings in the document : fieldUni = unidecode.unidecode(field).upper()
This has never caused any problems, except in the above case, when i get this: ' RP 477; P. 148 RDEG '
The " ° " has been "translated" into DEG. That is not what is meant here.
How do I avoid this translation in python (other then a manual ctrl-H replace '°' with ... etc.) in the text document?
thx,
Paul
In some pdfs I encounter references to the original parish register, like so: ref = ' RP 477; p. 148 r° '
I perform unidecode on all strings in the document : fieldUni = unidecode.unidecode(field).upper()
This has never caused any problems, except in the above case, when i get this: ' RP 477; P. 148 RDEG '
The " ° " has been "translated" into DEG. That is not what is meant here.
How do I avoid this translation in python (other then a manual ctrl-H replace '°' with ... etc.) in the text document?
thx,
Paul