Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Hashing big files
#1
I have to hash big files and even with fast algorithms, it takes a lot of time. Regular hdd, not ssd storages. I've tried the usual way but it is slow.
I am thinking to load a file check the first 4k and skip several megabytes then check again another 4k of data adding it to the hash sum.

I am asking how much megabytes is safe to skip in that process? I have to be sure that there are no collisions between the hashes if the files are not the same. Does that approach is going to work?
"As they say in Mexico 'dosvidaniya'. That makes two vidaniyas."
https://freedns.afraid.org
Reply


Messages In This Thread
Hashing big files - by wavic - Mar-25-2018, 12:24 PM
RE: Hashing big files - by Larz60+ - Mar-25-2018, 12:58 PM
RE: Hashing big files - by wavic - Mar-25-2018, 01:27 PM
RE: Hashing big files - by Gribouillis - Mar-25-2018, 02:22 PM
RE: Hashing big files - by wavic - Mar-25-2018, 10:53 PM
RE: Hashing big files - by micseydel - Apr-06-2018, 05:01 PM
RE: Hashing big files - by wavic - Apr-06-2018, 05:48 PM
RE: Hashing big files - by micseydel - Apr-06-2018, 06:51 PM
RE: Hashing big files - by wavic - Apr-06-2018, 07:26 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Hashing tuples remow 7 129,488 Dec-06-2019, 11:04 PM
Last Post: remow

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020