Python Forum
Comparing two files and word frequency
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Comparing two files and word frequency
#1
Hello,
  I Just registered on the forum and talked to some very helpful people on the irc channel.

Here is what I am trying to accomplish:

1. Able to pick out keyword frequency between two files, meaning one file has a list of keywords and the other file is the output in json or csv pertaining to URL, title, and summary or snippet. All contain some kind of text.
   
I would like to be able to find the all the keywords in the json/csv output that match up with the actual keywords from the keyword file.

Any help or suggestions is highly appreciated. I look forward to leaning a lot of this forum.

Thanks in advanced.
-Nix
Reply
#2
You can use nltk
see http://www.nltk.org/book/ch01.html
search on page (verbatim) for 'pair of words and compare their usage in two different texts'
Reply
#3
(Feb-02-2017, 05:57 PM)Larz60+ Wrote: You can use nltk
see http://www.nltk.org/book/ch01.html
search on page (verbatim) for 'pair of words and compare their usage in two different texts'

Thanks Ill check it out
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Help replacing word in Mutiple files. (SOLVED) mm309d 0 830 Mar-21-2023, 03:43 AM
Last Post: mm309d
  python print all files which contain specific word in it mg24 5 1,235 Jan-27-2023, 11:20 AM
Last Post: snippsat
  delete all files which contains word sql_Table1 mg24 2 853 Sep-15-2022, 10:05 PM
Last Post: mg24
Question Problem: Check if a list contains a word and then continue with the next word Mangono 2 2,488 Aug-12-2021, 04:25 PM
Last Post: palladium
  Searching for specific word in text files. JellyCreeper6 1 1,736 Nov-03-2020, 01:52 PM
Last Post: DeaD_EyE
  Complex word search multiple files Kristenl2784 0 1,578 Jul-18-2020, 01:22 PM
Last Post: Kristenl2784
  best option for comparing two csv files zuzuzu 1 2,120 Apr-15-2020, 05:20 PM
Last Post: Larz60+
  Python Speech recognition, word by word AceScottie 6 15,985 Apr-12-2020, 09:50 AM
Last Post: vinayakdhage
  Bulk Generating Cloze Deletions based on Tatoeba sentences and word frequency lists wizzie 10 5,134 Dec-23-2019, 12:16 PM
Last Post: wizzie
  Comparing columns of Matrix stored in .txt files JoelFooCJ 2 2,257 Dec-11-2019, 07:21 AM
Last Post: JoelFooCJ

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020