Python Forum

Full Version: unicode to ascii for look-like letters
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
does anyone know of a module or database to convert Unicode to ASCII of look-like letters? by that i mean any codes above 127 that have an appearance like or similar to a letter or any character in ASCII gets translated to that letter or character in ASCII that it looks like. for example the letter "A" with a variety of different accent marks or different display effects get translated to "A". if it wants input and/or gives results as numeric codes, that's OK.
Try the unidecode module
>>> from unidecode import unidecode
>>> 
>>> unidecode("L'été à la plage")
"L'ete a la plage"
that's a misleading name. but thinks for the info.