Python Forum
Scraping the page without distorting content
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Scraping the page without distorting content
#1
I am faced with the task of scraping a website page in such a way that its offline browser display matches the original page as closely as possible. At the same time, the content should not be changed or distorted (with the exception of setting up links). Thus, JS execution using Selenuim and / or browser add-ons does not help here (content is distorted). Is there a Python library that can help solve this problem? Example: in JS, the CSS address is programmatically calculated and then loaded into inline CSS.
Reply


Messages In This Thread
Scraping the page without distorting content - by oleglpts - Dec-15-2021, 07:27 AM

Possibly Related Threads…
Thread Author Replies Views Last Post
  I am scraping a web page but got an Error Sarmad54 3 1,501 Mar-02-2023, 08:20 PM
Last Post: Sarmad54
  Python Web Scraping can not getting all HTML content yqqwe123 0 1,685 Aug-02-2021, 08:56 AM
Last Post: yqqwe123
  Scraping a page with log in data (security, proxies) iamaghost 0 2,187 Mar-27-2021, 02:56 PM
Last Post: iamaghost
  Scraping .aspx page Larz60+ 21 51,626 Mar-18-2021, 10:16 AM
Last Post: Larz60+
  Scraping Whole Page Source GJG 1 2,187 Jan-13-2021, 03:19 PM
Last Post: GJG
  Web Scraping Inquiry (Extracting content from a table in asubdomain) DustinKlent 3 3,789 Aug-17-2020, 10:10 AM
Last Post: snippsat
  use Xpath in Python :: libxml2 for a page-to-page skip-setting apollo 2 3,679 Mar-19-2020, 06:13 PM
Last Post: apollo
  Scraping next page of LinkedIn jobs RiteshMahto 6 6,555 Dec-09-2019, 09:43 PM
Last Post: Larz60+
  Scraping data from ebay seller page yuvalta 3 6,044 Sep-25-2019, 04:22 AM
Last Post: sandramoraes
  Django Two blocks of dynamic content on one page iFunKtion 5 4,492 Jul-04-2019, 02:31 AM
Last Post: noisefloor

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020