Python Forum
Scrap data from not standarized page?
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Scrap data from not standarized page?
#5
Thank you very much for your input! :)

But in this case it wouldn't rather work. I was wondering about finding header of the table ie. "SUMMARY COMPENSATION TABLE" and then get table below.

If i would like to follow approach you have proposed then i would always need some1's name from the table, but they will always be different(and tables can have different named columns). The only common point in those files is this header i think.

The goal is to get automatically this table from X amount of files like this, so i think another approach would be needed (if it's even possible)
Reply


Messages In This Thread
Scrap data from not standarized page? - by zarize - Nov-20-2019, 02:27 PM
RE: Scrap data from not standarized page? - by zarize - Nov-25-2019, 10:25 AM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Web scrap --Need help Lizardpython 4 1,018 Oct-01-2023, 11:37 AM
Last Post: Lizardpython
  trying to save data automatically from this page thunderspeed 1 2,008 Sep-19-2021, 04:57 AM
Last Post: ndc85430
  Scraping a page with log in data (security, proxies) iamaghost 0 2,144 Mar-27-2021, 02:56 PM
Last Post: iamaghost
  I tried every way to scrap morningstar financials data without success so far sparkt 2 8,245 Oct-20-2020, 05:43 PM
Last Post: sparkt
  Web scrap multiple pages anilacem_302 3 3,827 Jul-01-2020, 07:50 PM
Last Post: mlieqo
  Need logic on how to scrap 100K URLs goodmind 2 2,617 Jun-29-2020, 09:53 AM
Last Post: goodmind
  use Xpath in Python :: libxml2 for a page-to-page skip-setting apollo 2 3,629 Mar-19-2020, 06:13 PM
Last Post: apollo
  Sending data to php page ebolisa 0 1,907 Mar-18-2020, 05:34 PM
Last Post: ebolisa
  scrape data 1 go to next page scrape data 2 and so on alkaline3 6 5,172 Mar-13-2020, 07:59 PM
Last Post: alkaline3
  Scrap a dynamic span hefaz 0 2,692 Mar-07-2020, 02:56 PM
Last Post: hefaz

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020