You can use PDFMiner, see: https://www.blog.pythonlibrary.org/2018/...th-python/
or better, but longer learning curve:
You can use NLTK FreqDist package.
you can install nltk with pip, but need to install corpora as well, see:
Installation instructions here: https://www.nltk.org/install.html
You can use (last command is run from shell):
or better, but longer learning curve:
You can use NLTK FreqDist package.
you can install nltk with pip, but need to install corpora as well, see:
Installation instructions here: https://www.nltk.org/install.html
You can use (last command is run from shell):
pip install nltk pip install numpy python -m nltk.downloader all