Hello Guys
I would like to ask a general theoretical question. We have right now a BI solution which uses excel files from sharepoint and the data is transformed in powerquery according to the needs of the business groups. In the near future we will have Microsoft Azure implementation, so the data will be available from there. I want to create a BI solution where you access the data on Azure and you create datasets for the various business groups using python data transformations, and after this you analize it in PowerBI. I think the main advantages of this would be:
Better performance: python goes through data way faster than PowerQuery.
Reusability: the created dataset could be reused whenever it is required.
What do you guys think is it a good direction to develop? Also I am having a couple of open questions:
Should I run the scripts inside PowerBI, or outside of it and ingest only the result into PowerBI?
What packages of python should I revise apart from the pandas and numpy, matplotlib ?
Should I use classes with parametres? Or simply create a script for each business group?
Thank you in advance
Mark
I would like to ask a general theoretical question. We have right now a BI solution which uses excel files from sharepoint and the data is transformed in powerquery according to the needs of the business groups. In the near future we will have Microsoft Azure implementation, so the data will be available from there. I want to create a BI solution where you access the data on Azure and you create datasets for the various business groups using python data transformations, and after this you analize it in PowerBI. I think the main advantages of this would be:
Better performance: python goes through data way faster than PowerQuery.
Reusability: the created dataset could be reused whenever it is required.
What do you guys think is it a good direction to develop? Also I am having a couple of open questions:
Should I run the scripts inside PowerBI, or outside of it and ingest only the result into PowerBI?
What packages of python should I revise apart from the pandas and numpy, matplotlib ?
Should I use classes with parametres? Or simply create a script for each business group?
Thank you in advance
Mark