May-18-2020, 04:26 AM
(This post was last modified: May-18-2020, 05:20 AM by CopBlaster.)
I installed Spacy but from what I am reading I also need a model for English. Spacy's installation directions say "python -m spacy download en_core_web_md". Since this is my first Python project I have no clue what that means. I tried typing it into my .py file and running it in Visual Studio only to get a syntax error.
According to Spacy that should not be my only option "Models can be installed from a download URL or a local directory, manually or via pip." Unfortunately, they do not say how to download using PIP. They say it can be done and offer no tips as to how.
Well, I managed to finally find the manual download link on GitHub, the problem was that the Spacy website links to it as "Release Details" instead of "download" or similar language.
I unzipped the files and copied them to my project in Visual Studio, but when I try to run setup.py I get an error that says:
Severity Code Description Project File Line Suppression State
Error file has both Unicode marker and PEP-263 file encoding. You must use "utf-8" as the encoding name when a BOM is present. PDFParser C:\Users\suername\source\repos\PDFParser\PDFParser\en_core_web_md-2.2.5\setup.py 2
My project looks like this:
This is where a screenshot would go if I could upload one, but instead this form wants the URL of an image already uploaded elsewhere.
According to Spacy that should not be my only option "Models can be installed from a download URL or a local directory, manually or via pip." Unfortunately, they do not say how to download using PIP. They say it can be done and offer no tips as to how.
Well, I managed to finally find the manual download link on GitHub, the problem was that the Spacy website links to it as "Release Details" instead of "download" or similar language.
I unzipped the files and copied them to my project in Visual Studio, but when I try to run setup.py I get an error that says:
Severity Code Description Project File Line Suppression State
Error file has both Unicode marker and PEP-263 file encoding. You must use "utf-8" as the encoding name when a BOM is present. PDFParser C:\Users\suername\source\repos\PDFParser\PDFParser\en_core_web_md-2.2.5\setup.py 2
My project looks like this:
This is where a screenshot would go if I could upload one, but instead this form wants the URL of an image already uploaded elsewhere.