Python Forum

Full Version: Comparing and Identifying ID with Percentage
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Hello, everyone

I am new in machine learning and I have one very specific problem at work but I don't where to go in order to get this problem solved.

I have one database (table 1) which contains information of vehicles, such as Manufacturer, Name, Period of Production, Cubic Capacity, Engine, horse power and so on.
I have another database (table 2) which contains all vehicles we have in the system (ERP) and I need to get the ID from table 2 and map to the corresponding vehicle in table 1.
The problem is that I can't just vlookup the tables because there are a few data which may vary and still be the correct vehicle.

For example:

Table 1:
VOLKSWAGEN, Golf, 1.8, 2015 -> 2019, Engine DBAE, 130HP

Table 2:
ID123, VOLKSWAGEN, Golf, 1.8, 2015 -> 2019, Engine DBFA, 130HP

Although the engine is different, I want to get the ID123.

Another example:

Table 1:
FIAT, Palio, 1.6, 2010 -> 2015, No info available for engine, 99HP

Table 2:
ID456, FIAT, Palio, 1.6, 2009 -> 2015, Engine FireEvo, 97HP

Even though a lot of informations do not match, I still want to get this ID.

If possible, I want to estimate which ID this vehicle may be.
ID456 -> 90%
ID789 -> 8%
ID101 -> 2%

Is it possible to do this? If so, where should I go in order to do this?

I really appreciate if you guys could enlighten me.

Greetings from Brazil!
Hy, you can use pandas dataframes for this.It would have been better if you could have attached those files with the question so that we can try and give you a proper solution.But if you are familiar with pandas datagram, follow these steps
Based on what I see the name of the car and its id have some relation,so u can make the name the index of these tables and join the tables. Here is a guide to pandas that can help you get started.
Dataframes with Pandas