Posts: 14
Threads: 5
Joined: Dec 2020
Hello,
Can somebody please tell me how I can count number of 'Buy', 'Sell' on a yearly basis for the chart below.
thanks,
YK303
Output:
SYMBOL PROCESS Year COMMISSION TRANSACTION_TYPE
@.HSE 170120C00016000 2016-01-12 2016 11.49 BUY
2016-01-13 2016 11.49 BUY
2016-01-26 2016 11.49 BUY
2016-01-28 2016 10.00 SELL
@AAPL 131227C00530000 2013-01-21 2013 7.49 BUY
... ... ... ... ...
NaN 2020-01-15 2020 0.00 INT
2020 0.00 INT
2020-01-16 2020 0.00 REB
2020-01-20 2020 0.00 EFT
2020-01-28 2020 0.00 EFT
Posts: 2,120
Threads: 10
Joined: May 2017
Is this a dataframe from pandas ?
With Python, you can use collections.Counter , but pandas should have a function for this. There is a count and a groupby function, but they don't return the frequency of unique values.
from collections import Counter
import pandas as pd
columns = ("SYMBOL", "PROCESS", "Year", "COMMISSION", "TRANSACTION_TYPE")
data = [
("AB", "C", 2010, "A", "sell"),
("AB", "C", 2010, "C", "buy"),
("AC", "C", 2012, "C", "buy"),
("AD", "C", 2010, "A", "sell"),
]
df = pd.DataFrame(data, columns=columns)
frequency = Counter(df["SYMBOL"])
for value, frequency in frequency.most_common():
print(f"Value: {value}, Freq: {frequency}") Output: Value: AB, Freq: 2
Value: AC, Freq: 1
Value: AD, Freq: 1
What you should read:
And maybe someone else has a solution with pandas-functions for this task.
Posts: 14
Threads: 5
Joined: Dec 2020
Jan-15-2021, 03:27 AM
(This post was last modified: Jan-15-2021, 03:27 AM by yk303.)
Hi DeaD_EyE,
I am indeed using Pandas and DataFrame.
I don't think this is what I am looking for. Is there a way I can attach an image on this post. I don't think I can.
I just want to tally up buys, sells, etc for year.
so the new table should look like this
Year Buy Sell
2012 20 15
2013 25 20
2014 22 8
2015 35 30
2016 24 22
Thanks,
YK303
(Jan-14-2021, 09:34 AM)DeaD_EyE Wrote: Is this a dataframe from pandas ?
With Python, you can use collections.Counter , but pandas should have a function for this. There is a count and a groupby function, but they don't return the frequency of unique values.
from collections import Counter
import pandas as pd
columns = ("SYMBOL", "PROCESS", "Year", "COMMISSION", "TRANSACTION_TYPE")
data = [
("AB", "C", 2010, "A", "sell"),
("AB", "C", 2010, "C", "buy"),
("AC", "C", 2012, "C", "buy"),
("AD", "C", 2010, "A", "sell"),
]
df = pd.DataFrame(data, columns=columns)
frequency = Counter(df["SYMBOL"])
for value, frequency in frequency.most_common():
print(f"Value: {value}, Freq: {frequency}") Output: Value: AB, Freq: 2
Value: AC, Freq: 1
Value: AD, Freq: 1
What you should read:
And maybe someone else has a solution with pandas-functions for this task.
Posts: 2,120
Threads: 10
Joined: May 2017
I'm not a regular user of pandas, so I've read also the documentation like others shuld.
import pandas as pd
columns = ("SYMBOL", "PROCESS", "Year", "COMMISSION", "TRANSACTION_TYPE")
data = [
("AB", "C", 2010, "A", "sell"),
("AB", "C", 2010, "C", "buy"),
("AC", "C", 2012, "C", "buy"),
("AD", "C", 2010, "A", "sell"),
]
df = pd.DataFrame(data, columns=columns)
data = []
for year, block in df.groupby("Year"):
buy = (block["TRANSACTION_TYPE"] == "buy").sum()
sell = (block["TRANSACTION_TYPE"] == "sell").sum()
data.append({"Year": year, "Buy": buy, "Sell": sell})
statistic = pd.DataFrame(data=data, columns=("Year", "Buy", "Sell")) I think this is a little closer to that what you want to solve.
I guess there is still a better way to do it.
|