followed by before closing

followed by before closing - Printable Version

+- Python Forum (https://python-forum.io)
+-- Forum: Python Coding (https://python-forum.io/forum-7.html)
+--- Forum: General Coding Help (https://python-forum.io/forum-8.html)
+--- Thread: followed by before closing (/thread-39638.html)

 followed by before closing - WJSwan - Mar-20-2023

I am trying to edit some HTML text in Python. I have an HTML file where there are sometimes a tag (Bold) and before it is closed with a there is another , eg:

<RF>1:4 Hom wat is ... kom:
he second should not be there.

Is it possible to write a regex pattern to find such occurrences and to delete the spurious ?

RE: followed by before closing - Axel_Erfurt - Mar-20-2023

something like that

import re

s = "<RF><b>1:4 <b>Hom wat is ... kom:</b>"

def repl(match, count=[0]):
    x, = count
    count[0] += 1
    if x > 0:
        return ''
    return '<b>'


print(re.sub('<b>', repl, s))

Output:
<RF><b>1:4 Hom wat is ... kom:</b>