doing string split with 2 or more split characters

Skaperen · (This post was last modified: Aug-04-2023, 02:54 AM by Skaperen.)

if i have the string 'ab|cd!ef|gh!ij|kl!mn' and want to split it into ['ab','cd','ef','gh','ij','kl','mn'], is there a better way than just replacing all the splitter characters to be the same? sometimes i get too many .replace() calls. suggestions to get this nice, pythonic, and in one line?

**deanhystad** · Aug-04-2023, 03:20 AM

re.split()?

Pedroski55 · Aug-04-2023, 06:44 AM

Can't see a one-liner to do this.

# add a unwanted character at the end of the string or you won't get the last wanted characters
mystring = 'ab|cd!ef|gh!ij|kl!mn|'
# define what you want to keep
wanted = 'abcdefghijklmnopqrstuvwxyz'
count = 0
for i in range(len(mystring)):    
    if not mystring[i] in wanted:
        seq = mystring[count:i]
        print(seq)
        count = i+1

menator01 · Aug-04-2023, 09:14 AM

As deanhystad suggested

import re
string = 'ab|cd!ef|gh!ij|kl!mn'
print(re.split('[\|!]', string))

Output:
['ab', 'cd', 'ef', 'gh', 'ij', 'kl', 'mn']

***snippsat*** · Aug-04-2023, 01:50 PM

(Aug-04-2023, 06:44 AM)Pedroski55 Wrote: Can't see a one-liner to do this

>>> s = 'ab|cd!ef|gh!ij|kl!mn'
>>> ''.join(c if not c in '|!' else ' ' for c in s).split()
['ab', 'cd', 'ef', 'gh', 'ij', 'kl', 'mn']

With regex can just use \W(matches any non-word character)

>>> import re
>>> 
>>> s = 'ab|cd!ef|gh!ij|kl!mn'
>>> re.split(r'\W', s)
['ab', 'cd', 'ef', 'gh', 'ij', 'kl', 'mn']

Pedroski55 · Aug-04-2023, 03:27 PM

Oh yes, but then you have imported re to do the work, which is, possibly, somewhat longer than 1 line!

**Gribouillis** · Aug-04-2023, 03:29 PM

(Aug-04-2023, 03:27 PM)Pedroski55 Wrote: but then you have imported re to do the work, which is, possibly, somewhat longer than 1 line!

__import__('re').split(r'\W', s)

**deanhystad** · Aug-04-2023, 03:48 PM

And you used a module. Pedroski55 prefers not using any modules and longs for a way to directly enter the python bytecodes.

***snippsat*** · Aug-04-2023, 04:18 PM

(Aug-04-2023, 03:27 PM)Pedroski55 Wrote: Oh yes, but then you have imported re to do the work, which is, possibly, somewhat longer than 1 line!

Did you not 👀 the first one.

''.join(c if not c in '|!' else ' ' for c in s).split()

Pedroski55 · Aug-05-2023, 05:13 AM

No, sorry, didn't see that! Very good, I like it!

Didn't know you can put so much in ''.join()!!

# original string
s = 'ab|cd!ef|gh!ij|kl!mn'
# add anything you want to keep
wanted = 'abcdefghijklmnopqrstuvwxyz'
# things you don't want
unwanted = set([s[i] for i in range(len(s)) if not s[i] in wanted])      
# from snippsat I like this
''.join(c if not c in unwanted else ' ' for c in s).split()

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	[split] How to continue code after .show() in matplotlib?	pythonnewbie62	1	281	Apr-28-2025, 04:02 PM Last Post: deanhystad
	[split] ibm_db install problem	SQLPython	1	765	Feb-13-2025, 07:24 PM Last Post: buran
	[split] Newbie needs help	Schoe1	0	454	Feb-12-2025, 06:57 PM Last Post: Schoe1
	how to split pdf under 10mb using python	skchui9786	4	1,248	Jan-18-2025, 03:25 AM Last Post: skchui9786
	[split] another problem with code	blakeusherremix68	0	441	Dec-23-2024, 11:36 PM Last Post: blakeusherremix68
	[split] Code help	emma1423	1	636	Dec-13-2024, 02:00 PM Last Post: perfringo
	[split] Prime numbers	saima	1	586	Dec-09-2024, 02:19 AM Last Post: jefsummers
	[split] How to ask Smart Questions (thread title expansion)	darkuser	4	1,568	Nov-11-2024, 01:27 PM Last Post: deanhystad
	[split] Help with my coding	happy_nutella	1	728	Oct-08-2024, 06:52 PM Last Post: jefsummers
	Unable to understand the function string.split()	Hudjefa	8	2,745	Sep-16-2024, 04:25 AM Last Post: Pedroski55

doing string split with 2 or more split characters

User Panel Messages

Announcements