Regular Expression for matching words

xinyulon · (This post was last modified: Mar-09-2022, 09:50 PM by xinyulon.)

Hello. I'm really not sure about the defined re pattern for this exercise, especially how to define the string words between == on both sides in order to remove it as a whole header. Could you please provide me with any tips if you happen to know that. Thank you! Angel

My exercise:

Wikipedia uses two or more equal signs == to mark headers and subheaders in the articles (e.g. ==, ===, ====).

In all cases, the equal signs and the actual header text are separated by spaces on both sides, e.g. == History == or === Further reading ===.

Import the re module and define a regular expression that removes all headers and subheaders from the articles. Store this regular expression under a variable named pattern.

Apply the regular expression under pattern to each article (string object) in the list wiki_articles. Store each processed article into a new list named cleaned_articles.

My answer:

import re
cleaned_articles=[]
for string_object in wiki_articles:
    pattern = re.compile(r'={2,}.+')     #not sure about the defined pattern
    processed = pattern.sub(repl='', string=string_object)
    cleaned_articles.append(processed)

***snippsat*** · Mar-09-2022, 10:34 PM

You most make test strings to see what happens,and no loop before have test this first.

import re

string_object = '''\
== History ==
=== Further reading ===
My car is blue
2 + 2 = 4
++= & hello='''

pattern = re.compile(r'={2,}.+')
processed = pattern.sub(repl='', string=string_object)
print(processed.strip())

Output:My car is blue
2 + 2 = 4
++= & hello=

So your regex should work fine and added .strip() to remove the new line that sub leave.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Regular expression help	anilrajr	4	1,760	May-08-2024, 06:18 PM Last Post: deanhystad
	data validation with specific regular expression	shaheen07	0	865	Jan-12-2024, 07:56 AM Last Post: shaheen07
	Regular Expression search to comment lines of code	Gman2233	5	2,827	Sep-08-2022, 06:57 AM Last Post: ndc85430
	List Creation and Position of Continue Statement In Regular Expression Code	new_coder_231013	3	2,668	Jun-15-2022, 12:00 PM Last Post: new_coder_231013
	Need help with my code (regular expression)	shailc	5	2,931	Apr-04-2022, 07:34 PM Last Post: shailc
	regular expression question	Skaperen	4	3,419	Aug-23-2021, 06:01 PM Last Post: Skaperen
	Generate a string of words for multiple lists of words in txt files in order.	AnicraftPlayz	2	4,027	Aug-11-2021, 03:45 PM Last Post: jamesaarr
	How can I find all combinations with a regular expression?	AlekseyPython	0	2,153	Jun-23-2021, 04:48 PM Last Post: AlekseyPython
	Python Regular expression, small sample works but not on file	Acernz	5	4,318	Jun-09-2021, 08:27 PM Last Post: bowlofred
	Regular expression: cannot find 1st number in a string	Pavel_47	2	3,091	Jan-15-2021, 04:39 PM Last Post: bowlofred

Regular Expression for matching words

User Panel Messages

Announcements