Python Forum
Thread Rating:
  • 1 Vote(s) - 2 Average
  • 1
  • 2
  • 3
  • 4
  • 5
How can I access the corpus
#1


Hello,

I created a corpus for a number of documents as:

from gensim import models, corpora

corpus = corpora.BleiCorpus('./data/ap/ap.dat', './data/ap/vocab.txt')
Now I want to access its documents for comparison purposes. How can I do that?

Thanks
Reply
#2
There are many tutorials here: https://radimrehurek.com/gensim/tutorial.html
This all appears quite new, only showing up under PyPi Nov 11, 2017.
You can probably also load the corpora into NLTK which is a very mature
and very well documented Natural Language Processing package
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Analyzing large text file with nltk.corpus (stopwords ) Drone4four 9 6,489 Jun-06-2019, 09:30 PM
Last Post: Drone4four

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020