Not on regex stuff,the whole source.
reading = connect_to.read().decode('utf-8')But you should use Requests,then you get correct encoding that source use.
import requests url = 'https://instagram.com/p/BL1rrSQDu48' url_get = requests.get(url) #print(url_get.text) # All source print(url_get.encoding) # ISO-8859-1