SimHash, MinHash, and LpHash currently all have implementations of single_hash_collision_probability (MIPSHash and SignALSH are currently missing them, see issue #4). There should also be a function collision_probability(::LSHFunction, sim) that computes the overall probability that the total hash (i.e. the vector of individual hashes) generated by an LSHFunction is equal for two inputs with similarity sim.
Note that if single_hash_collision_probability is already implemented then this is relatively easy to implement using
collision_probability(hashfn::LSHFunction, sim::Real) =
single_hash_collision_probability(hashfn, sim)^nhashes(hashfn)
SimHash,MinHash, andLpHashcurrently all have implementations ofsingle_hash_collision_probability(MIPSHashandSignALSHare currently missing them, see issue #4). There should also be a functioncollision_probability(::LSHFunction, sim)that computes the overall probability that the total hash (i.e. the vector of individual hashes) generated by anLSHFunctionis equal for two inputs with similaritysim.Note that if
single_hash_collision_probabilityis already implemented then this is relatively easy to implement using