Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    singulars
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. COI'm not sure I understood your answer, so sorry if this isn't what you are looking for! So what you're saying is: there is no guarantee when I divide up my data based on the results of some relationship between the data, because fuzzy matching fails transitivity, there is no guarantee that the pairs created will encompass all pairs that are in fact similar (satisfy that relationship); or, all pairs that satisfy will be present, but some false positives as well. I prefer accuracy in this case, so if there is a choice, I'd go for the latter. But I probably misunderstood your answer :)
      singulars
    2. COHere's the simplest example illustrating the dilemma. Suppose ~ is "are edit-distance 1 apart". We have the words "goo", "foo", and "food". How should they be de-duplicated? There are no duplicates in the result {"goo", "food"} (and the remainder, "foo", is a duplicate of at least one of these); there are no duplicates in the result {"foo"} (and the remainders, "goo" and "good", are each duplicates of at least one of these); but these two results are significantly different. Which one do you prefer? Can you say carefully why you prefer that one?
      singulars
    3. COGiven the words "goo", "foo", and "food", and a "duplicate" threshold of 1 edit distance, it seems to me that "goo" and "foo" should match (aka be marked as possible duplicates) and "foo" and "food" should as well, but "goo" and "food" should not be marked as duplicates, because they are 2 edit-distance apart. But this seems relatively straightforward, so I think I might not be understanding your comment (once again!). Thanks for your patience explaining this, I really appreciate it!
      singulars
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload