Python Forum

Full Version: A function to return only certain columns with certain string
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Hello,

I have this data set with 11 columns:

print(US_data)
Output:
Real_Personal_Income ... Retail_Sales_ExFood_YY Date ... 2000-01-31 10,856.0 ... - 2000-02-29 10,900.0 ... - 2000-03-31 10,929.0 ... - 2000-04-30 10,984.0 ... - 2000-05-31 11,022.0 ... - ... ... ... ... 2020-02-29 17,256.0 ... 4.4 2020-03-31 16,916.0 ... -2.6 2020-04-30 18,829.0 ... -15.3 2020-05-31 18,017.0 ... -1.1 2020-06-30 - ... 5.0
I'm trying to find/work out a function that can be used so that only the columns with 'YY' are returned. I assume it would be a if statement but I haven't been able to work it out.

Any help is appreciated.
if US_data is a Pandas dataframe, you can filter columns by name, e.g.
cols_wanted = [col for col in US_data.columns if col.endswith('YY')]. Finally, you can select these columns, e.g. US_data[cols_wanted]. Probably, if 'YY' in col will be more suitable condition.
(Jul-24-2020, 02:23 AM)scidam Wrote: [ -> ]if US_data is a Pandas dataframe, you can filter columns by name, e.g.
cols_wanted = [col for col in US_data.columns if col.endswith('YY')]. Finally, you can select these columns, e.g. US_data[cols_wanted]. Probably, if 'YY' in col will be more suitable condition.

Thanks!