Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. VO
      singulars
      1. This table or related slice is empty.
    2. VO
      singulars
      1. This table or related slice is empty.
    3. VO
      singulars
      1. This table or related slice is empty.
    1. COYes! I think this could be what I am looking for. I did some reading up on LSH, and I'm a little lost in terms of implementing it. The idea seems to be to calculate the hash of the values in such a way that similar values have a high probability of colliding hashes. Then you could run more precise algorithms on values with colliding hashes. Basically, reduce the dimensionality to reduce computational intensity. What I don't understand is what hashing function to use - aren't they designed to try to minimize collisions? Are we talking about hash functions like SHA or MD5?
      singulars
    2. COGood hash functions - such as SHA - are defined to look like random functions, which do produce fewer collisions than some very bad hash function. But the calculations that justify LSH assume perfectly random hash functions as building blocks, so that is fine. Bad hash functions produces collisions by hashing similar inputs to the same output, but LSH doesn't rely on this at all. It just relies on the hash function being consistent, so that if h(ACKOV) = 13 in one place, h(ACKOV) = 13 in another place. If you find a good practical writeup of LSH, use the hash function in the writeup.
      singulars
    3. COI think that makes sense :) ... any chance you would know of such a writeup, or at least a direction to look? My attempts so far have been largely fruitless. I've turned up several highly technical resources, but they are more complex than I can handle given my current background.
      singulars
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload