Jul-14-2017, 08:15 PM
(This post was last modified: Jul-14-2017, 08:16 PM by python_enthusiast.)
Suppose we have the following table:
I am trying to do a duplicate check on only Name and Date column and a check to make sure that Title column is "not" duplicate. Salary column does not matter here.
Name Title Date Salary
Joan Analyst 25 51
Joan Support 25 49
Martha CEO 46 44
Martha CEO 46 10
If i use dedupe=df.drop_duplicates, that would give me only record with Joan. What i need instead is to drop Joan and keep Martha's records.
Any assistance would be much appreciated. Thanks
I am trying to do a duplicate check on only Name and Date column and a check to make sure that Title column is "not" duplicate. Salary column does not matter here.
Name Title Date Salary
Joan Analyst 25 51
Joan Support 25 49
Martha CEO 46 44
Martha CEO 46 10
If i use dedupe=df.drop_duplicates, that would give me only record with Joan. What i need instead is to drop Joan and keep Martha's records.
Any assistance would be much appreciated. Thanks