Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Corpora catalof for NLTK
#1
Hello,

I've been playing with NLTK (again) today. There's quite a good list of available corpora available here
what I have been looking for is how to get a copy of that list programmatically, without having to
scrape the page.

Does anyone know how to do this?

If not, expect to see a scraper for it soon in snippets.


catalof is like a loaf of bread, only made with cata's
Reply
#2
I am posting under snippets some code that creates a json file from the Corpora website.
same title (with spelling correction)
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Help with simple nltk Chatbot Extra 3 1,841 Jan-02-2022, 07:50 AM
Last Post: bepammoifoge
  Saving a download of stopwords (nltk) Drone4four 1 9,105 Nov-19-2020, 11:50 PM
Last Post: snippsat
  Installing nltk dependency Eshwar 0 1,796 Aug-30-2020, 06:10 PM
Last Post: Eshwar
  Clean Data using NLTK disruptfwd8 0 3,301 May-12-2018, 11:21 PM
Last Post: disruptfwd8
  Text Processing and NLTK (POS tagging) TwelveMoons 2 4,857 Mar-16-2017, 02:53 AM
Last Post: TwelveMoons
  NLTK create corpora pythlang 5 10,085 Oct-26-2016, 07:31 PM
Last Post: Larz60+
  serious n00b.. NLTK in python 2.7 and 3.5 pythlang 24 19,484 Oct-21-2016, 04:15 PM
Last Post: pythlang

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020