algorithm - Hash stable to small changes in text -
is there hash function stable small changes in text? i'm looking opposite of cryptographic hash, small changes in source lead huge changes in result.
something perceptual hash text. there such thing?
edited: "small changes in text" mean changes in punctuation, correction of ortographic / grammatical mistakes, etc. text article, wikipedia entry (but can smaller, 2 or 3 paragraphs).
bonus points if can point python implementation.
you're looking locality sensitive hashing.
Comments
Post a Comment