Python Forum
Linguistic measures on corporate filings
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Linguistic measures on corporate filings
#1
Hi,

I have an excel file which includes txt file name and year. I am trying to conduct textual analysis for corporate disclosure (a bunch of txt files), but having difficulty in generating the following measures. Can anyone provide some guidance? Thank you!!

1. The number of words in sentences that include at least one 4-word phrase that is shared by at least 75% of all firms in a given fiscal year.

2. The number of words in sentences that include at least one 8-word phrase that is identical to a phrase used in the prior year’s 10-K.
Reply
#2
Hi,

I have an excel file which includes txt file name and year. I am trying to conduct textual analysis for corporate disclosure (a bunch of txt files, firm-year level), but having difficulty in generating the following measures. Can anyone provide some guidance? Thank you!!!

1. The number of words in sentences that include at least one 4-word phrase that is shared by at least 75% of all firms in a given fiscal year.

2. The number of words in sentences that include at least one 8-word phrase that is identical to a phrase used in the prior year’s 10-K.
Reply
#3
Is this a homework assignment?
You should show what you have attempted, and where you are having difficulty,.
Reply
#4
Please don't double post.
same post as https://python-forum.io/Thread-Textual-a...-txt-files
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  General linear model with repeated measures Ziv1279 1 2,615 Dec-20-2020, 11:45 PM
Last Post: PsyPy

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020