Skip to main content
added 60 characters in body
Source Link

I'm trying to create a script that will take a snapshot of the contents of a directory. For each file, all possible metadata will be recorded and written to the database. The point is that with some transfer of this directory, you can take a new snapshot and compare it with the original one taken in the "safe zone" to check whether any third-party changes have been made to the files.

Along with the metadata, I want to capture the hash of each file. I am aware of the existence of a collision problem and the possibility of file substitution so that its hash remains unchanged. Does using multiple hashes increase the complexity of implementing such an attack? Or does it not make sense?

And I am actually interested in this question from a mathematical point of view, if it is possible to consider it using formulas. If this is the case, how can I try to prove the usefulness/uselessness of using multiple hashes for my purpose mathematically?

Edit: Thanks for all the answers, it helped me a lot

I'm trying to create a script that will take a snapshot of the contents of a directory. For each file, all possible metadata will be recorded and written to the database. The point is that with some transfer of this directory, you can take a new snapshot and compare it with the original one taken in the "safe zone" to check whether any third-party changes have been made to the files.

Along with the metadata, I want to capture the hash of each file. I am aware of the existence of a collision problem and the possibility of file substitution so that its hash remains unchanged. Does using multiple hashes increase the complexity of implementing such an attack? Or does it not make sense?

And I am actually interested in this question from a mathematical point of view, if it is possible to consider it using formulas. If this is the case, how can I try to prove the usefulness/uselessness of using multiple hashes for my purpose mathematically?

I'm trying to create a script that will take a snapshot of the contents of a directory. For each file, all possible metadata will be recorded and written to the database. The point is that with some transfer of this directory, you can take a new snapshot and compare it with the original one taken in the "safe zone" to check whether any third-party changes have been made to the files.

Along with the metadata, I want to capture the hash of each file. I am aware of the existence of a collision problem and the possibility of file substitution so that its hash remains unchanged. Does using multiple hashes increase the complexity of implementing such an attack? Or does it not make sense?

And I am actually interested in this question from a mathematical point of view, if it is possible to consider it using formulas. If this is the case, how can I try to prove the usefulness/uselessness of using multiple hashes for my purpose mathematically?

Edit: Thanks for all the answers, it helped me a lot

Tweeted twitter.com/StackCrypto/status/1612283263237627904
Became Hot Network Question
Source Link

Does using multiple hashes (to check if a file has been spoofed) reduce collisions?

I'm trying to create a script that will take a snapshot of the contents of a directory. For each file, all possible metadata will be recorded and written to the database. The point is that with some transfer of this directory, you can take a new snapshot and compare it with the original one taken in the "safe zone" to check whether any third-party changes have been made to the files.

Along with the metadata, I want to capture the hash of each file. I am aware of the existence of a collision problem and the possibility of file substitution so that its hash remains unchanged. Does using multiple hashes increase the complexity of implementing such an attack? Or does it not make sense?

And I am actually interested in this question from a mathematical point of view, if it is possible to consider it using formulas. If this is the case, how can I try to prove the usefulness/uselessness of using multiple hashes for my purpose mathematically?