Python Forum

Full Version: Linking Data in Pandas
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Hi All,

I have loaded four .csv files into data frames in pandas, how do I go about linking them? They all have anĀ ID columns, I have an additional level of complexity in that the ID column in one of the files is corrupted. My research says I can find the Levenshtein distance between the ID's, is anyone able to elaborate?

Thanks
Is the same id shared between files? Or are they just row ids?
If file1_id == file2_id == file3_id == file4_id, then... they're already linked?
Duplicate post