URL DECODING

Thread Rating:

0 Vote(s) - 0 Average
1
2
3
4
5

Thread Modes

URL DECODING

snippsat

Administrators

Posts: 7,151

Threads: 122

Joined: Sep 2016

Reputation: 499

Jan-02-2019, 12:56 AM (This post was last modified: Jan-02-2019, 12:56 AM by snippsat.)

Have to be careful when downloading HTML to disk,and then try to read it back again so encoding don't get mess up.
Should always use Requests,then get correct encoding back.
Example:

>>> import requests
>>> 
>>> response = requests.get('https://www.contextures.com/xlSampleData01.html')
>>> response.status_code
200
>>> response.encoding
'ISO-8859-1'

To disk:

import requests

response = requests.get('https://www.contextures.com/xlSampleData01.html')
html = response.text
with open('html_raw.html', 'w', encoding='ISO-8859-1') as f_out:
    f_out.write(html)

Read saved data pandas:

I could of course also read the url,without saving to disk.

df = pd.read_html('http://www.contextures.com/xlSampleData01.html', header=0)

Find

Messages In This Thread

URL DECODING - by UnionSystems - Jan-01-2019, 11:04 PM

RE: URL DECODING - by snippsat - Jan-02-2019, 12:56 AM

RE: URL DECODING - by UnionSystems - Jan-02-2019, 01:36 AM

RE: URL DECODING - by snippsat - Jan-02-2019, 01:49 AM

RE: URL DECODING - by UnionSystems - Jan-02-2019, 03:55 AM

RE: URL DECODING - by UnionSystems - Jan-02-2019, 05:28 PM

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Decoding lat/long in file name	johnmcd	4	610	Mar-22-2024, 11:51 AM Last Post: johnmcd
	Enigma Decoding Problem	krisarmstrong	4	1,023	Dec-14-2023, 10:42 AM Last Post: Larz60+
	json decoding error	deneme2	10	4,281	Mar-22-2023, 10:44 PM Last Post: deanhystad
	flask app decoding problem	mesbah	0	2,478	Aug-01-2021, 08:32 PM Last Post: mesbah
	Decoding a serial stream	AKGentile1963	7	9,168	Mar-20-2021, 08:07 PM Last Post: deanhystad
	xml decoding failure(bs4)	roughstroke	1	2,384	May-09-2020, 04:37 PM Last Post: snippsat
	python3 decoding problem but python2 OK	mesbah	0	1,889	Nov-30-2019, 04:42 PM Last Post: mesbah
	utf-8 decoding failed every time i try	adnanahsan	21	11,620	Aug-27-2019, 04:25 PM Last Post: adnanahsan
	hex decoding in Python 3	rdirksen	2	4,788	May-12-2019, 11:49 AM Last Post: rdirksen
	Decoding log files in binary using an XML file.	captainfantastic	1	2,542	Apr-04-2019, 02:24 AM Last Post: captainfantastic

Users browsing this thread: 1 Guest(s)

View a Printable Version

URL DECODING

User Panel Messages

Announcements