Python Forum

Full Version: Pandas dataframe merge
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
my two datasets are

patient_info:

patient_id	year_born	sex
a7G2n	  1985	       m
h9GH5	  1981	       f
b75GH	  1991	       f
AGK67	  1995	       f
gh8j9	  2000	       m
K78DA	  1600	       m
gh8j9	  2000	       m


patient_results:

id	   erythrocytes	ESR
a7G2n	      4.5	      7
h9GH5	      5.1	
b75GH	      5.3	      3
AGK67		    2
gh8j9	      6.1	     10
K78DA	      5.5	      7

my code is :
import pandas as pd
patient_info = pd.read_csv('patient_info.csv')
patient_results = pd.read_csv('patient_results.csv')
patient_info = patient_info.drop(5)
patient_info = patient_info.duplicated().values.any()
patient_results = patient_results.fillna(0.0)
all_patient_data =  patient_info.merge(patient_results, left_on='patient_id', right_on='id')

I then receive the following error

Error:

Traceback (most recent call last):
  File "run.py", line 139, in <module>
    exec(open(userscript_path).read())
  File "<string>", line 7, in <module>
AttributeError: 'numpy.bool_' object has no attribute 'merge'


I am fairly new to python and data frames.

Please advise on where the error may be?
I believe that in line 30 you convert your dataframe into a boolean object. Would start there.