replace string inside double quotes

jmpatx · (This post was last modified: Apr-21-2017, 09:00 PM by metulburr.)

I am trying to replace commas with a "^" in a string that is already quoted. Below you can see input and output. However my code does not replace the string.

Any help would be appreciated.

Input:

Output:Id,Category,Description,Date
1,Test,Red Cars,02/12/2017
2,Test,Blue Cars,03/01/2017
3,Test,"Green, big cars",01/05/2016

Output should be:

Output:Id,Category,Description,Date
1,Test,Red Cars,02/12/2017
2,Test,Blue Cars,03/01/2017
3,Test,"Green^ big cars",01/05/2016

import csv

ifile = open('C:/Users/jpilon/Documents/test.csv', 'r')
reader = csv.reader(ifile,delimiter=',')
ofile = open('C:/Users/jpilon/Documents/test_new.csv', 'w')
writer = csv.writer(ofile, delimiter=',')


findlist = ['"*,*"']
replacelist = ['"*^*"']

rep = dict(zip(findlist, replacelist))

def findReplace(find, replace):
   s = ifile.read()
   s = s.replace(find, replace)
   ofile.write(s)

for item in findlist:
   findReplace(item, rep[item])

ifile.close()
ofile.close()

***sparkz_alot*** · Apr-22-2017, 12:45 PM

Could just be me, but your input looks exactly like your output.

***snippsat*** · (This post was last modified: Apr-22-2017, 04:06 PM by snippsat.)

You don't use reader that csv module makes.
In line 15 you read() all in as string.

So read in and make a nested list,then can replace values and keep csv structure.

import csv

with open('in.csv') as f:
    reader = csv.reader(f, delimiter=',')
    cars_info = [i for i in reader]

Test:

>>> cars_info
[['Id', 'Category', 'Description', 'Date'],
 ['1', 'Test', 'Red Cars', '02/12/2017'],
 ['2', 'Test', 'Blue Cars', '03/01/2017'],
 ['3', 'Test', 'Green, big cars', '01/05/2016']]
>>> cars_info[3][2]
'Green, big cars'

>>> cars_info[3][2] = "Green^ big cars"
>>> cars_info
[['Id', 'Category', 'Description', 'Date'],
 ['1', 'Test', 'Red Cars', '02/12/2017'],
 ['2', 'Test', 'Blue Cars', '03/01/2017'],
 ['3', 'Test', 'Green^ big cars', '01/05/2016']]

jmpatx · Apr-24-2017, 03:32 PM

Thanks snippsat. I am new to Python, so forgive my ignorance. I understand what you are doing, I guess where I am lost is how to apply it to a global find and replace in my code.

If I were certain that a column 3 could potentially have the double quotes, then how would I replace any that met that criteria?

**nilamo** · Apr-24-2017, 03:43 PM

Using the csv module is probably the way to go, since it'll handle the quotes for you. But as usual, a regular expression also works:

>>> text = '''
... Id,Category,Description,Date
... 1,Test,Red Cars,02/12/2017
... 2,Test,Blue Cars,03/01/2017
... 3,Test,"Green, big cars",01/05/2016
... '''
>>> import re
>>> regex = re.compile(r'("[^",]*),([^",]*")')
>>> print(regex.sub(r'\1^\2', text))

Id,Category,Description,Date
1,Test,Red Cars,02/12/2017
2,Test,Blue Cars,03/01/2017
3,Test,"Green^ big cars",01/05/2016

jmpatx · Apr-24-2017, 10:29 PM

The regular expression seemed to work on my example data, but when I tried a larger file with more columns, it did not replace the comma within the quotes.

Was the regular expression someone pointing to column 2 only?

import re

with open('file.csv') as f:
    s = f.read() + '\n'  # add trailing new line character

regex = re.compile(r'("[^",]*),([^",]*")')

s1 = (regex.sub(r'\1^\2', s))

print(s1)

f=open('file.csv',"w")
f.write(s1)
f.close()

wavic · Apr-25-2017, 04:16 AM

Just use csv module

import csv

with open('/tmp/input.csv', 'r') as in_file:
    data = csv.reader(in_file, delimiter=',')
    for row in data:
        print([col.replace(',', '^') for col in row])

Output:['Id', 'Category', 'Description', 'Date']
['1', 'Test', 'Red Cars', '02/12/2017']
['2', 'Test', 'Blue Cars', '03/01/2017']
['3', 'Test', 'Green^ big cars', '01/05/2016']

jmpatx · Apr-25-2017, 08:18 PM

I did finally figure out how to read the csv and write new values to csv using my test file. Thanks for all your help! Smile

Now when I try this on a 1 gig csv file, I run into memory error. I know there are ways to do this in chunks, but that should be a question in a new thread.

import csv

new_rows_list = []

# Read File
f1 = open('in_file', 'r')
reader = csv.reader(f1, delimiter=',')
for row in reader:
    new_row = ([col.replace(',', '^') for col in row])
    new_rows_list.append(new_row)


# Write File
f2 = open('out_file', 'w')
writer = csv.writer(f2)
writer.writerows(new_rows_list)
f2.close()
f1.close()

**nilamo** · (This post was last modified: Apr-26-2017, 05:09 AM by nilamo.)

(Apr-25-2017, 08:18 PM)jmpatx Wrote:

import csv

new_rows_list = []

# Read File
f1 = open('in_file', 'r')
reader = csv.reader(f1, delimiter=',')
for row in reader:
    new_row = ([col.replace(',', '^') for col in row])
    new_rows_list.append(new_row)


# Write File
f2 = open('out_file', 'w')
writer = csv.writer(f2)
writer.writerows(new_rows_list)
f2.close()
f1.close()

Don't store the whole file in memory, just work on it line-by-line:

import csv

with open("in_file", "r", newline="") as f1:
    reader = csv.reader(f1, delimiter=",")
    with open("out_file", "w", newline="") as f2:
        writer = csv.writer(f2)
        for row in reader:
            new_row = [col.replace(",", "^") for col in row]
            writer.writerow(new_row)

***snippsat*** · Apr-26-2017, 05:26 AM

(Apr-26-2017, 05:08 AM)nilamo Wrote: Don't store the whole file in memory, just work on it line-by-line:

Yepp better.
Can also write it like this,one with is enough.

import csv

with open("in.csv") as f1,open("out.csv", "w", newline="") as f2:
    reader = csv.reader(f1, delimiter=",")
    writer = csv.writer(f2)
    for row in reader:
        new_row = [col.replace(",", "^") for col in row]
        writer.writerow(new_row)

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	[SOLVED] How to replace characters in a string?	Winfried	2	994	Sep-04-2024, 01:41 PM Last Post: Winfried
	Need to replace a string with a file (HTML file)	tester_V	1	1,884	Aug-30-2023, 03:42 AM Last Post: Larz60+
	Replace string in a nested Dictianory.	SpongeB0B	2	2,370	Mar-24-2023, 05:09 PM Last Post: SpongeB0B
	Replace with upper(string)	WJSwan	7	2,761	Feb-10-2023, 10:28 AM Last Post: WJSwan
	Need help on how to include single quotes on data of variable string	hani_hms	5	7,104	Jan-10-2023, 11:26 AM Last Post: codinglearner
	Find and Replace numbers in String	giddyhead	2	2,987	Jul-17-2022, 06:22 PM Last Post: giddyhead
	Replace String in multiple text-files [SOLVED]	AlphaInc	5	11,125	Aug-08-2021, 04:59 PM Last Post: Axel_Erfurt
	Replace String with increasing numer [SOLVED]	AlphaInc	13	8,154	Aug-07-2021, 08:16 AM Last Post: perfringo
	Remove single and double quotes from a csv file in 3 to 4 column	shantanu97	0	8,536	Mar-31-2021, 10:52 AM Last Post: shantanu97
	Two types of single quotes	Led_Zeppelin	2	2,565	Mar-15-2021, 07:55 PM Last Post: BashBedlam

replace string inside double quotes

User Panel Messages

Announcements