Python Forum

Full Version: how to read chinese character?
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Hi, I try to translate a Chinese page to an English page,
but the result is gibberish,
how to "convert" It to Chinese?
# importing the modules
import requests
from bs4 import BeautifulSoup
# target url
url = ""
# making requests instance
reqs = requests.get(url)
# using the BeautifulSoup module
soup = BeautifulSoup(reqs.text, 'html.parser')
# displaying the title
print("Title of the website is : ")
for title in soup.find_all('title'):
Title of the website is : ÎÒÔÚԭʼÉç»áµ±´å³¤×îÐÂÕ½ÚÁбí_ÎÒÔÚԭʼÉç»áµ±´å³¤×îÐÂÕ½ÚĿ¼_²©ÊËÊéÎÝ
thank you for reading, have a nice day
Change to reqs.content.
This mean that Bs4 is given bytes and it will deal with Unicode,using reqs.text it can be mix up between Requests and Bs4.
Quote:Any HTML or XML document is written in a specific encoding like ASCII or UTF-8.
But when you load that document into Beautiful Soup, you’ll discover it’s been converted to Unicode:
Unicode, Dammit guesses correctly most of the time.
# importing the modules
import requests
from bs4 import BeautifulSoup

# target url
url = ""

# making requests instance
reqs = requests.get(url)

# using the BeautifulSoup module
soup = BeautifulSoup(reqs.content, 'html.parser')

# displaying the title
print("Title of the website is : ")
for title in soup.find_all('title'):
(Aug-25-2022, 07:47 PM)snippsat Wrote: [ -> ]Change to reqs.content.
This mean that Bs4 is given bytes and it will deal with Unicode,using reqs.text it can be mix up between Requests and Bs4.
Quote:Any HTML or XML document is written in a specific encoding like ASCII or UTF-8.
But when you load that document into Beautiful Soup, you’ll discover it’s been converted to Unicode:
Unicode, Dammit guesses correctly most of the time.
# importing the modules
import requests
from bs4 import BeautifulSoup

# target url
url = ""

# making requests instance
reqs = requests.get(url)

# using the BeautifulSoup module
soup = BeautifulSoup(reqs.content, 'html.parser')

# displaying the title
print("Title of the website is : ")
for title in soup.find_all('title'):

thank you, i looking this for hours,
i give you reputation point