Mar-02-2019, 01:14 AM
My girlfriend has been given the task of getting all the data from a webpage. The web page belongs to the adult education centre where she works. To get to the webpage, you must first log in. The url is a .asp file.
She has to put the data in an Excel sheet. The entries are student names, numbers, ID card number, telephone, courses, books etc. There are thousands of entries. HR students alone has 70 pages of entries. This all shows up on the webpage as a table. It is possible to copy and paste.
I can handle Python openpyxl reasonably well these days and I have heard of web-scraping, which I believe Python can do.
I don't know what .asp is.
Could you please give me some tips, pointers, about how to get the data with Python? What should I look at or learn?
Can I automate this task?
She has to put the data in an Excel sheet. The entries are student names, numbers, ID card number, telephone, courses, books etc. There are thousands of entries. HR students alone has 70 pages of entries. This all shows up on the webpage as a table. It is possible to copy and paste.
I can handle Python openpyxl reasonably well these days and I have heard of web-scraping, which I believe Python can do.
I don't know what .asp is.
Could you please give me some tips, pointers, about how to get the data with Python? What should I look at or learn?
Can I automate this task?