Jan-17-2020, 10:45 AM
Hi
I am newly learning data science and am wondering if the below will qualify for a project that can be implemented in Python using ML algorithms:
I have master data set that I will have to extract from a pdf. It will have 2 fields e.g. Area Code and Area as below:
AreaCode Area
3100 Gate
3110 Sumps
3230 Fireworks
4222 Air Purifier
4335 Water Filter
I have a second dataset which is created after searching a pdf and extracting data having one field Object Name e.g.
ObjectName
A1-G-3100012
A1-K-3100010
A1-K-3230010
A1-P-3230015
A1-P-4222015
A1-G-4235016
A1-G-4335012
A1-K-3110010
A1-K-3230010
A1-P-3230025
A1-P-4335075
A1-G-4235086
A1-M-3100012
A1-H-3100010
A1-H-3230010
A1-V-3230015
A1-V-4222015
A1-M-4235016
A1-M-4335012
A1-H-3110010
A1-H-3230010
A1-V-3230025
A1-V-4335075
A1-M-4235086
I want to create a model that will learn first dataset and populate AreaCode in second dataset.
Does this make sense for an application of datascience?
Sorry about my ignorance but requesting some inputs.
Regards
I am newly learning data science and am wondering if the below will qualify for a project that can be implemented in Python using ML algorithms:
I have master data set that I will have to extract from a pdf. It will have 2 fields e.g. Area Code and Area as below:
AreaCode Area
3100 Gate
3110 Sumps
3230 Fireworks
4222 Air Purifier
4335 Water Filter
I have a second dataset which is created after searching a pdf and extracting data having one field Object Name e.g.
ObjectName
A1-G-3100012
A1-K-3100010
A1-K-3230010
A1-P-3230015
A1-P-4222015
A1-G-4235016
A1-G-4335012
A1-K-3110010
A1-K-3230010
A1-P-3230025
A1-P-4335075
A1-G-4235086
A1-M-3100012
A1-H-3100010
A1-H-3230010
A1-V-3230015
A1-V-4222015
A1-M-4235016
A1-M-4335012
A1-H-3110010
A1-H-3230010
A1-V-3230025
A1-V-4335075
A1-M-4235086
I want to create a model that will learn first dataset and populate AreaCode in second dataset.
Does this make sense for an application of datascience?
Sorry about my ignorance but requesting some inputs.
Regards