Python Forum
Pigz inside python - Reading compressed .gz file much faster
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Pigz inside python - Reading compressed .gz file much faster
#7
http://aripollak.com/pythongzipbenchmarks/

Looks like the speed depends pretty heavily on which version of python you're running.  You might also gain some improvement by wrapping the gzip object in io.BufferedReader.

I wouldn't mind seeing more of your code, though, as 45minutes for 1gb sounds excessive.  Depending on what you're doing (and the power of the computer it's running on), maybe we can create a process queue and take advantage of multiple cores/processors.
Reply


Messages In This Thread
RE: Pigz inside python - Reading compressed .gz file much faster - by nilamo - Dec-29-2017, 07:34 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
Sad problems with reading csv file. MassiJames 3 728 Nov-16-2023, 03:41 PM
Last Post: snippsat
  Navigating file directories and paths inside Jupyter Notebook Mark17 5 815 Oct-29-2023, 12:40 PM
Last Post: Mark17
  Reading a file name fron a folder on my desktop Fiona 4 1,006 Aug-23-2023, 11:11 AM
Last Post: Axel_Erfurt
  Reading data from excel file –> process it >>then write to another excel output file Jennifer_Jone 0 1,163 Mar-14-2023, 07:59 PM
Last Post: Jennifer_Jone
  Reading a file JonWayn 3 1,152 Dec-30-2022, 10:18 AM
Last Post: ibreeden
  Reading Specific Rows In a CSV File finndude 3 1,036 Dec-13-2022, 03:19 PM
Last Post: finndude
  Excel file reading problem max70990 1 929 Dec-11-2022, 07:00 PM
Last Post: deanhystad
  Reading All The RAW Data Inside a PDF NBAComputerMan 4 1,430 Nov-30-2022, 10:54 PM
Last Post: Larz60+
  Replace columns indexes reading a XSLX file Larry1888 2 1,029 Nov-18-2022, 10:16 PM
Last Post: Pedroski55
  Failing reading a file and cannot exit it... tester_V 8 1,866 Aug-19-2022, 10:27 PM
Last Post: tester_V

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020