Aug-02-2020, 11:29 AM
Hello,
I have a requirement where I have two sets of CSV files(unsorted).
CSV are huge (>10 million records and > 50 columns).
I need to compare these files and highlight differences such as -
1. Rows those are not matching
2. Values those are not matching in those rows.
I have tried filecmp. However, it works on sorted data only.
So could you please suggest any way in Python to efficiently achieve above requirement.
Thanks in Advance
I have a requirement where I have two sets of CSV files(unsorted).
CSV are huge (>10 million records and > 50 columns).
I need to compare these files and highlight differences such as -
1. Rows those are not matching
2. Values those are not matching in those rows.
I have tried filecmp. However, it works on sorted data only.
So could you please suggest any way in Python to efficiently achieve above requirement.
Thanks in Advance