Note that there are some explanatory texts on larger screens.

plurals
  1. POSimilar String algorithm
    primarykey
    data
    text
    <p>I'm looking for an algorithm, or at least theory of operation on how you would find similar text in two or more different strings...</p> <p>Much like the question posed here: <a href="https://stackoverflow.com/questions/246961/algorithm-to-find-similar-text">Algorithm to find articles with similar text</a>, the difference being that my text strings will only ever be a handful of words.</p> <p>Like say I have a string: "Into the clear blue sky" and I'm doing a compare with the following two strings: "The color is sky blue" and "In the blue clear sky"</p> <p>I'm looking for an algorithm that can be used to match the text in the two, and decide on how close they match. In my case, spelling, and punctuation are going to be important. I don't want them to affect the ability to discover the real text. In the above example, if the color reference is stored as "'sky-blue'", I want it to still be able to match. However, the 3rd string listed should be a BETTER match over the second, etc.</p> <p>I'm sure places like Google probably use something similar with the "Did you mean:" feature...</p> <p><strong>* EDIT *</strong><br> In talking with a friend, he worked with a guy who wrote a paper on this topic. I thought I might share it with everyone reading this, as there are some really good methods and processes described in it...</p> <p>Here's the <a href="http://jeffhuang.com/sigir09_submit.pdf" rel="nofollow noreferrer">link to his paper</a>, I hope it is helpful to those reading this question, and on the topic of similar string algorithms.</p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload