Python Forum
Random Forest to Identify Page: Feature Selection
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Random Forest to Identify Page: Feature Selection
#1
Hi,

I am new to machine learning. I know of a proj that used Random Forest to identify the type of pages in financial reports - identify if a page is the CashFlow or Income Statement.

The features for the model:
1) Bag of Word (BOW) for all pages in all the financial reports
2) word_check_flow: 1 if page has word "flow"; 0 otherwise
3) word_check_income: 1 if page has {“income” & “expense”} or {“revenue”, “sales”, “loss”}; 0 otherwise

I am puzzled as to know why there is a need for word_check_flow & word_check_income as features when BOW will give the count of each word in the page.

Thank you
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Feature Selection in Machine Learning shiv11 4 1,850 Apr-09-2024, 02:22 PM
Last Post: DataScience
  Feature Selection with different units and strings ltloug01 2 1,946 Oct-16-2020, 01:24 AM
Last Post: jefsummers
  Can't make Random Forest Prediction work donnertrud 0 1,627 May-23-2020, 12:26 PM
Last Post: donnertrud
  Random Forest Hyperparamter Optimization donnertrud 1 1,943 Jan-17-2020, 06:30 AM
Last Post: scidam
  Random Forest high R2 Score but poor prediction donnertrud 5 4,966 Jan-13-2020, 11:23 PM
Last Post: jefsummers
  Unable to identify Tuple index error appmkl 4 4,692 Jun-28-2019, 10:12 AM
Last Post: appmkl
  Python+Dash+ Can't get menu page to feed to page georgelza 0 2,431 Apr-15-2018, 02:09 PM
Last Post: georgelza
  AUCPR of individual features using Random Forest (Error: unhashable Type) melissa 1 3,313 Jul-10-2017, 12:48 PM
Last Post: sparkz_alot

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020