May-13-2019, 05:48 AM
Thank you so much, it's doing the most, however I still get this in my output:
Output:image003.jpgimage/[email protected]:37:13truefalseimage004.jpgimage/[email protected]:37:13truefalseimage005.jpgimage/[email protected]:37:13truefalse396972019-01-28T10:45:35Z2019-01-28T10:45:47ZfalsefalsedigitalSMTPOneOffMarguerite
How do I clean it?(May-08-2019, 11:50 AM)michalmonday Wrote:import re with open('email.txt', 'r') as f: text = f.read() p = re.compile(r'<!--.*-->',re.DOTALL) text = p.sub('', text) p = re.compile(r'^\s*$', re.MULTILINE) text = p.sub('', text) print(text)