Code improvement: groupby and operation on conditionals

**perfringo** · Dec-06-2019, 11:41 AM

I think that main problem is the way how data is structured. I assume that this is in file.

I would:

- read data from file into four - columns structure (compound, location, value, flag)
- use pd.DataFrame to get data needed

import pandas as pd

with open('max_value_compound.txt', 'r') as f:
    data = []
    headers = next(f)
    for row in f:
        location, *rest = row.split()
        for i, compound in zip(range(0, len(rest), 2), range(1, 4)):
            data.append([compound, location, *rest[i:i+2]])

df = pd.DataFrame(data, columns=['compound', 'location', 'value', 'flag'])

df[df['flag'] == 'U'].pivot_table(index=['location', 'compound'], values=['value'], aggfunc=max)   # both giving same output/result
df[df['flag'] == 'U'].groupby(['location', 'compound']).agg({'value': 'max'})

Output:                 value
location compound      
A        2          0.4
         3          0.2
B        1            5
         2          3.2
         3            8
C        2         0.16
D        2         0.30
E        1            2

If location is not needed then one can skip 'location':

df[df['flag'] == 'U'].pivot_table(index=['compound'], values=['value'], aggfunc=max) 
df[df['flag'] == 'U'].groupby(['compound']).agg({'value': 'max'})

which will give:

Output:         value
compound      
1            5
2          3.2
3            8

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	pandas change row value an existing column with conditionals	Gigux	1	2,986	Jun-22-2019, 08:04 PM Last Post: Gigux
	Genetic Algorithm improvement	Alberto	0	4,320	Oct-19-2017, 02:13 PM Last Post: Alberto

Code improvement: groupby and operation on conditionals

User Panel Messages

Announcements