Python getting Getting a hash string for a very large file

edited tags

Link

edited Oct 22, 2015 at 9:00

200_success

145.7k
22
191
481

deleted 2 characters in body

Source Link

edited Oct 21, 2015 at 22:58

Quill

12.1k
5
41
94

After reading about large files and memory problems, I'm suspecting that my code below may be inefficient because it reads the entire files into memory before applying the hash algorithm. IsIs there a better way??

chunk_size = 1024 hasher = hashlib.md5() while True: try: data = f.read(chunk_size) except IOError, e: log.error('error hashing %s on Agent %s' % (path, agent.name)) return {'error': '%s' % e} if not data: break hasher.update(data) hash_string = hasher.hexdigest()

After reading about large files and memory problems, I'm suspecting that my code below may be inefficient because it reads the entire files into memory before applying the hash algorithm. Is there a better way??

chunk_size = 1024 hasher = hashlib.md5() while True: try: data = f.read(chunk_size) except IOError, e: log.error('error hashing %s on Agent %s' % (path, agent.name)) return {'error': '%s' % e} if not data: break hasher.update(data) hash_string = hasher.hexdigest()

After reading about large files and memory problems, I'm suspecting that my code below may be inefficient because it reads the entire files into memory before applying the hash algorithm. Is there a better way?

chunk_size = 1024 hasher = hashlib.md5() while True: try: data = f.read(chunk_size) except IOError, e: log.error('error hashing %s on Agent %s' % (path, agent.name)) return {'error': '%s' % e} if not data: break hasher.update(data) hash_string = hasher.hexdigest()

Source Link

asked Oct 21, 2015 at 22:47

MFB

337
4
12

Loading

Stack Exchange Network

Return to Question

Python getting Getting a hash string for a very large file