Python Forum
Extract and rename a file from an Archive
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Extract and rename a file from an Archive
#1
Greetings and happy Holidays!
I’m trying to extract a specific file from archives without the whole directory structure.
Each archive holds the same file name, I have to rename it or it’ll be overwritten.
I thought adding a time stamp to the file name or just replacing the name with the time stamp will be a good idea.
I can pull out the file I need, it works fine, but I’m having a hard time renaming it during the extraction.
Should I rename the file after it is extracted?
Here is the code:
    with zipfile.ZipFile(ef) as zip:
        for items in zip.infolist():
            ff = items.filename
            fn = Path(ff).name  
            #print(f" File Names in the Archive  :: {fn}")                    
            if fn.startswith("Loop") and fn.endswith(".txt") :
                timestm = datetime.now().strftime('%H%M%S%m%d%Y') # ----- Timestamp to add to the file name
                print(f"  {timestm}")
                mm =items.filename = os.path.basename(items.filename) # - Exctacts File only
                print(' FILE TO Copy .....', mm) 
                time.sleep(1)
                try :
                    zip.extract(items, trl_dr_out)                  
                except BadZipfile as bzip:                       
                    print(f" BAD ZIP {bzip}")                            
                    pass 
Any help is appreciated.
Thank you.
alexjordan likes this post
Reply
#2
(Jul-07-2024, 04:14 AM)tester_V Wrote: Should I rename the file after it is extracted?
If you look at the zipfile code in the CPython source tree, extracting calls the _extract_member() method which performs some path manipulation and then calls the 3 following lines to do the work
        with self.open(member, pwd=pwd) as source, \
             open(targetpath, "wb") as target:
            shutil.copyfileobj(source, target)
You could define your own targetpath and write similar lines in your code instead of calling extract().
tester_V likes this post
« We can solve any problem by introducing an extra level of indirection »
Reply
#3
I see. Thank you!
Reply
#4
Just rename using arcname="newname". It seems there is no .remove function in zipfile, so people say copy to a new zip and change the name when you do that. I did find other modules with a .remove function, but apparently that is not implemented in zipfile for some reason.

If the zip file is very big, this may not be the best idea, but for smaller zip files, this is easily done.

Otherwise, use a module that has .remove(), extract the file or files concerned, found with re, remove them from the zip file, rename the extracted files and add them to the zip

from zipfile import ZipFile
import re
from datetime import datetime
from pathlib import Path
import os

path2zip = '/home/pedro/tmp/loopme.zip'
path2newzip = '/home/pedro/tmp/loopme_changed.zip'
file2add = '/home/pedro/tmp/Loop_1.txt'
savepath  = '/home/pedro/tmp/'

# a re to find files with name Loop_something.txt
f = re.compile(r'Loop[^\\/]*\.txt$')

# below is a generator so only runs 1 time
# reload to repeat
images = Path(savepath).glob("*.jpg")

# make a zip
# add some files just for fun + a file called Loop_1.txt
with ZipFile(path2zip, 'w') as zin:
    for image in images:
        savename = image.name
        zin.write(image, arcname=savename)
    savename = file2add.split('/')[-1]
    zin.write(file2add, arcname=savename)

# can't delete from zip files without difficulty
# can't rename the file in the zip easily
# so if the zip is not too big, just copy and exclude
with ZipFile(path2zip, 'r') as zin, ZipFile(pathnewzip, 'a') as zout:    
    for file in zin.filelist:
        print(file.filename)
        if not f.search(file.filename):
            print(file.filename)
            zout.write(savepath + file.filename, arcname=file.filename)
        else:
            newname = datetime.now().strftime('%H%M%S%m%d%Y') + '.txt'
            zout.write(file.filename, arcname=newname)
            
# rename and replace if necessary
os.remove(path2zip)
os.rename(path2newzip, path2zip) # unpacks ok
tester_V likes this post
Reply
#5
That's an interesting suggestion!
Thank you!
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  JSON File - extract only the data in a nested array for CSV file shwfgd 2 1,010 Aug-26-2024, 10:14 PM
Last Post: shwfgd
  rename same file names in different directories elnk 5 2,343 Jul-12-2024, 01:43 PM
Last Post: snippsat
  Extracting specific file from an archive tester_V 4 1,980 Jan-29-2024, 06:41 PM
Last Post: tester_V
  Rename first row in a CSV file James_S 3 1,548 Dec-17-2023, 05:20 AM
Last Post: James_S
  rename file RolanRoll 0 1,060 May-18-2023, 02:17 PM
Last Post: RolanRoll
  '' FTP '' File upload with a specified string and rename midomarc 1 2,171 Apr-17-2023, 03:04 AM
Last Post: bowlofred
  Extract file only (without a directory it is in) from ZIPIP tester_V 1 3,780 Jan-23-2023, 04:56 AM
Last Post: deanhystad
  Installing Pillow from remote archive in Linux Flatpak ChrisOfBristol 6 2,718 Sep-23-2022, 07:48 PM
Last Post: ChrisOfBristol
  How to extract specific data from .SRC (note pad file) Shinny_Shin 2 2,131 Jul-27-2022, 12:31 PM
Last Post: Larz60+
  rename and add desire "_date" to end of file name before extention RolanRoll 1 1,922 Jun-13-2022, 11:16 AM
Last Post: gruntfutuk

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020