Mar-14-2019, 07:49 PM
I have a client who wants to be able to parse and extract the message portions from an mbox (email) file. The mbox I have as an example has huge sections of what appears to be encrypted text. The code below extracts the text portions correctly, I think, but I'm not sure if the code is supposed to handle the seemingly-encrypted mbox text or if the mbox just has encrypted portions that can't be read.
Does the code below look correct to read/extract mbox data: to, from, subject, and body?
Thanks very much in advance,
-O
---
Does the code below look correct to read/extract mbox data: to, from, subject, and body?
Thanks very much in advance,
-O
---
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 |
import os import mailbox import sys import pprint print ( "Reading emails:" ) mbox_file = "/Users/oliver/Desktop/mbox" print ( "Processing " + mbox_file) mbox = mailbox.mbox(mbox_file) for key in mbox.iterkeys(): try : message = mbox[key] except mbox.errors.MessageParseError: continue # The message is malformed. Just leave it. print ( "From: " + message[ 'from' ]) print ( "To: " + message[ 'to' ]) print ( "Subject: " + str (message[ 'Subject' ])) print ( "-----------------------------" ) print ( "Body\n" ) print (message) print ( "********************************************" ) |