Strange Characters in JSON returned string

fioranosnake · Nov-22-2019, 01:12 PM

Hi,

I'm getting some strange characters at the beginning of an string that I am setting up for a request payload.

In this example I am trying to read a text file containing one data item and then enumerate it.

with open('list.txt', 'r') as f:
             d = {k: v.strip() for (k, v) in enumerate(f, start=1)}
             data = json.dumps(d)
    
             with open('out.txt', 'w') as f_out:  
             f_out.write(data)

the resulting out.txt contains

{"1": "\u00ef\u00bb\u00bf10841911101489"}

the return should read :

{"1": "10841911101489"}

Can anyone help please?

Many thanks,

Fiorano

**Larz60+** · Nov-22-2019, 04:55 PM

hard to say without knowing what's in list.txt, can you provide that?

fioranosnake · Dec-02-2019, 02:10 PM

It seems as though its due to the encoding of my source txt file.

By default it is output from my source app as a UTF-8 file. If I open and save this as an ANSI text file the process works. Is there anything I can do at the start of my python app to amend the encoding?

Thanks for your help

DeaD_EyE · (This post was last modified: Dec-02-2019, 03:56 PM by DeaD_EyE.)

You read a file, which has a UTF8 encoding with Byte Order Mark: EF BB BF

Use as encoding utf-8-sig when you read the file.
Then the BOM is stripped away.

The prefix \uxxxx is just a representation for Unicode code points in Json and also for Python.
What you see, are the first three bytes, which defines the Byte Order. Usually this is not used.
I guess you must seek for documents, which are still using the Byte Order Mark.

fioranosnake · Dec-02-2019, 07:25 PM

(Dec-02-2019, 03:56 PM)DeaD_EyE Wrote: You read a file, which has a UTF8 encoding with Byte Order Mark: EF BB BF

Use as encoding utf-8-sig when you read the file.
Then the BOM is stripped away.

The prefix \uxxxx is just a representation for Unicode code points in Json and also for Python.
What you see, are the first three bytes, which defines the Byte Order. Usually this is not used.
I guess you must seek for documents, which are still using the Byte Order Mark.

Perfect Thank You!!!

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	[SOLVED] How to replace characters in a string?	Winfried	2	1,148	Sep-04-2024, 01:41 PM Last Post: Winfried
	get JSON string from URL with Windows credentials	shwfgd	0	688	Aug-27-2024, 10:08 PM Last Post: shwfgd
	How to access values returned from inquirer	cspower	6	2,852	Dec-26-2023, 09:34 PM Last Post: cspower
	doing string split with 2 or more split characters	Skaperen	22	6,590	Aug-13-2023, 01:57 AM Last Post: Skaperen
	How do I check if the first X characters of a string are numbers?	FirstBornAlbratross	6	3,258	Apr-12-2023, 10:39 AM Last Post: jefsummers
	[split] Parse Nested JSON String in Python	mmm07	4	2,868	Mar-28-2023, 06:07 PM Last Post: snippsat
	string indices must be integers when parsing Json	ilknurg	3	10,355	Mar-10-2022, 11:02 AM Last Post: DeaD_EyE
	SQLAlchemy Object Missing when Null is returned	Personne	1	3,369	Feb-19-2022, 02:50 AM Last Post: Larz60+
	[SOLVED] Delete specific characters from string lines	EnfantNicolas	4	3,378	Oct-21-2021, 11:28 AM Last Post: EnfantNicolas
	Getting "name 'get_weather' is not defined error and no json_data returned?	trthskr4	6	5,390	Sep-14-2021, 09:55 AM Last Post: trthskr4

Strange Characters in JSON returned string

User Panel Messages

Announcements