Python Forum
Unbalance Dataset - prediction model
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Unbalance Dataset - prediction model
#1
Hey everyone.

I am working on personal project that can change the face of the restaurant industry.

Let’s make it simple. Dataset of 63k rows, 7 columns. 6 significant characteristics to me target value. 2 two target values ( Show or no show). For instance, I want to build a model that is predicting if a person will show or not show at a restaurant knowing some characteristics ( Type of guest, party size, visits completed, day, hours, month). However, I have 53k rows for reservations that are qualified “Done” against 6k rows for my no show. I built random forest and regression, giving me shit results. Why? How should I deal with that? I have something big, but my model… Any help would be appreciated!

I can forward you beginning of my data set which are encoded such as Day 1 = Lundi Hours 2= between 6 to 7 Month = 3 March Type of Client 3= Member Visits completed 4 Size = 5 meaning 5 people at the table

Attached Files

Thumbnail(s)
   
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Urgent:help about prediction size kzbn 5 828 Apr-29-2024, 06:25 PM
Last Post: deanhystad
  Adapting mnist model with personal dataset GJG 0 960 Oct-09-2022, 02:38 PM
Last Post: GJG
  Can't make Random Forest Prediction work donnertrud 0 1,717 May-23-2020, 12:26 PM
Last Post: donnertrud
  Prediction of Coal Fire Power Plant Pollutants Emission Dalpi 2 2,273 May-08-2020, 06:28 PM
Last Post: Dalpi
  prediction using linear regression (extrapolation?) in a loop karlito 0 3,368 Feb-05-2020, 10:56 AM
Last Post: karlito
  Random Forest high R2 Score but poor prediction donnertrud 5 5,324 Jan-13-2020, 11:23 PM
Last Post: jefsummers

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020