[SOLVED] How to replace characters in a string?

Winfried · (This post was last modified: Sep-04-2024, 01:33 PM by Winfried.)

Hello,

I can't get Python to replace a string in a string: If I read the HTML file as binary, it fails; If I open it as text, it fails too :-/

#UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 14167: character maps to <undefined>
#with open(INPUTFILE, 'r') as f:

with open(INPUTFILE, 'rb') as f:
	content_text = f.read()

#TypeError: a bytes-like object is required, not 'str'
#content_text.replace("<i>","[i]")
content_text = str(content_text)
content_text.replace("<i>","[i]")

#for some reason, no matter the input (string or bytes), BS will just output first line
content_text = content_text.encode(encoding='UTF-8')
soup = BeautifulSoup(content_text, 'xml')
#<?xml version="1.0" encoding="utf-8"?>, and then stops
print(soup.prettify())

What's the right way to 1) replace a string in a string, and then 2) have Beautiful Soup parse the input successfully?

Thank you.

---
Edit: Since it's no possible to delete a thread, I'll just add the answer… which was simple enough: Read the file as text telling Python which encoding to use (Windows=Latin1 by default), which BeautifulSoup reads fine (doesn't need to be bytes)

with open(INPUTFILE, 'r',encoding='utf-8') as f:
	content_text = f.read()

content_text.replace("<i>","[i]")

soup = BS(content_text, 'xml')
print(soup.prettify())

**Gribouillis** · Sep-04-2024, 01:37 PM

Strings are immutable. The replace() method creates a new string and leaves the original string unchanged.

>>> s = "spam spam <i> spam"
>>> t = s.replace("<i>", "[i]")
>>> t
'spam spam [i] spam'
>>> s
'spam spam <i> spam'
>>>

Winfried · Sep-04-2024, 01:41 PM

Good to know, thanks

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	[SOLVED] Open file, and insert space in string?	Winfried	7	613	May-28-2025, 07:56 AM Last Post: Winfried
	[SOLVED] [Beautiful Soup] Replace tag.string from another file?	Winfried	2	654	May-01-2025, 03:43 PM Last Post: Winfried
	[SOLVED] Special characters in XML	ForeverNoob	3	2,108	Dec-04-2024, 01:26 PM Last Post: ForeverNoob
	[SOLVED] Sub string not found in string ?	jehoshua	4	1,619	Dec-03-2024, 09:17 PM Last Post: jehoshua
	Need to replace a string with a file (HTML file)	tester_V	1	2,076	Aug-30-2023, 03:42 AM Last Post: Larz60+
	doing string split with 2 or more split characters	Skaperen	22	7,003	Aug-13-2023, 01:57 AM Last Post: Skaperen
	How do I check if the first X characters of a string are numbers?	FirstBornAlbratross	6	3,434	Apr-12-2023, 10:39 AM Last Post: jefsummers
	Replace string in a nested Dictianory.	SpongeB0B	2	2,594	Mar-24-2023, 05:09 PM Last Post: SpongeB0B
	Replace with upper(string)	WJSwan	7	3,096	Feb-10-2023, 10:28 AM Last Post: WJSwan
	[SOLVED] [BeautifulSoup] Why does it turn inserted string's brackets into </>?	Winfried	0	2,830	Sep-03-2022, 11:21 PM Last Post: Winfried

[SOLVED] How to replace characters in a string?

User Panel Messages

Announcements