Python Forum
Assigning Column nunique values to another DataFrame column
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Assigning Column nunique values to another DataFrame column
#1
I'm having issues assigning an nunique value to a dataframe column.

Can't show real data but dataframe df is like below:

Doc_Set 	Doc_Num    Status
	
Set_123	    ED-0001    Complete
Set_123	    EG-0002    Complete
Set_123	    EG-0002    Complete 
Set_456	    EN-0010    In Progress
Set_456	    EN-0010    In Progress
Set_789	    ED-0500    Complete
I need another dataframe 'df2' with unique Doc_Set values and number of unique Doc_Num values for each, where Status = Complete. So df2 should look like:

Doc_Set 	Docs_Unique    
	
Set_123	        2  
Set_789	        1
I'm doing the following:

df2['Doc_Set'] = df[df['Status'] == 'Complete'].Doc_Set.unique()

for d in df2['Doc_Set']:
df2['Docs_Unique'] = df[df['Doc_Set']==d].Doc_Num.nunique()
Somehow every row in df2['Docs_Unique'] ends up as '5'.
Although, if I print(df[df['Doc_Set']==d].Doc_Num.nunique()) in the for loop, I get correct values.

What am I doing wrong?
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Column Transformer with Mixed Types - sklearn aaldb 0 243 Feb-22-2024, 03:27 PM
Last Post: aaldb
  Assigning conditional values in Pandas Scott 3 725 Dec-19-2023, 03:10 AM
Last Post: Larz60+
  concat 3 columns of dataframe to one column flash77 2 776 Oct-03-2023, 09:29 PM
Last Post: flash77
  HTML Decoder pandas dataframe column mbrown009 3 962 Sep-29-2023, 05:56 PM
Last Post: deanhystad
  attempt to split values from within a dataframe column mbrown009 8 2,218 Apr-10-2023, 02:06 AM
Last Post: mbrown009
  Finding the median of a column in a huge CSV file markagregory 5 1,732 Jan-24-2023, 04:22 PM
Last Post: DeaD_EyE
  Make unique id in vectorized way based on text data column with similarity scoring ill8 0 861 Dec-12-2022, 03:22 AM
Last Post: ill8
  Impute 1 if previous row of 'days' column is between 0 & 7 JaneTan 2 1,051 Dec-08-2022, 07:42 PM
Last Post: deanhystad
  Increase df column values decimals SriRajesh 2 1,084 Nov-14-2022, 05:20 PM
Last Post: deanhystad
  pandas column percentile nuncio 7 2,383 Aug-10-2022, 04:41 AM
Last Post: nuncio

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020