Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
URL DECODING
#2
Have to be careful when downloading HTML to disk,and then try to read it back again so encoding don't get mess up.
Should always use Requests,then get correct encoding back.
Example:
>>> import requests
>>> 
>>> response = requests.get('https://www.contextures.com/xlSampleData01.html')
>>> response.status_code
200
>>> response.encoding
'ISO-8859-1'
To disk:
import requests

response = requests.get('https://www.contextures.com/xlSampleData01.html')
html = response.text
with open('html_raw.html', 'w', encoding='ISO-8859-1') as f_out:
    f_out.write(html)
Read saved data pandas:
[Image: rZ7nNw.jpg]
I could of course also read the url,without saving to disk.
df = pd.read_html('http://www.contextures.com/xlSampleData01.html', header=0)
Reply


Messages In This Thread
URL DECODING - by UnionSystems - Jan-01-2019, 11:04 PM
RE: URL DECODING - by snippsat - Jan-02-2019, 12:56 AM
RE: URL DECODING - by UnionSystems - Jan-02-2019, 01:36 AM
RE: URL DECODING - by snippsat - Jan-02-2019, 01:49 AM
RE: URL DECODING - by UnionSystems - Jan-02-2019, 03:55 AM
RE: URL DECODING - by UnionSystems - Jan-02-2019, 05:28 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Decoding lat/long in file name johnmcd 4 610 Mar-22-2024, 11:51 AM
Last Post: johnmcd
  Enigma Decoding Problem krisarmstrong 4 1,023 Dec-14-2023, 10:42 AM
Last Post: Larz60+
  json decoding error deneme2 10 4,281 Mar-22-2023, 10:44 PM
Last Post: deanhystad
  flask app decoding problem mesbah 0 2,478 Aug-01-2021, 08:32 PM
Last Post: mesbah
  Decoding a serial stream AKGentile1963 7 9,168 Mar-20-2021, 08:07 PM
Last Post: deanhystad
  xml decoding failure(bs4) roughstroke 1 2,384 May-09-2020, 04:37 PM
Last Post: snippsat
  python3 decoding problem but python2 OK mesbah 0 1,889 Nov-30-2019, 04:42 PM
Last Post: mesbah
  utf-8 decoding failed every time i try adnanahsan 21 11,620 Aug-27-2019, 04:25 PM
Last Post: adnanahsan
  hex decoding in Python 3 rdirksen 2 4,788 May-12-2019, 11:49 AM
Last Post: rdirksen
  Decoding log files in binary using an XML file. captainfantastic 1 2,542 Apr-04-2019, 02:24 AM
Last Post: captainfantastic

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020