StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
5304359
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
10
CommunityOwnedDate
2011-03-16T08:48:11.057
CreationDate
2011-03-14T20:51:33.943
FavoriteCount
0
LastActivityDate
2011-03-16T08:47:01.400
LastEditDate
2011-03-16T08:47:01.400
LastEditorUserId
619275
OwnerUserId
619275
ParentId
5299996
PostTypeId
2
Score
11
ViewCount
0
LastEditorDisplayName
text
Body
If you really want to define similarity in the exact way that you have formulated in your question, then you would - as you say - have to implement the Levensthein Distance calculation. Either in code calculated on each row retrieved by a DataReader or as a SQL Server function. The problem stated is actually more tricky than it may appear at first sight, because you cannot assume to know what the mutually shared elements between two strings may be. So in addition to Levensthein Distance you probably also want to specify a minimum number of consecutive characters that actually have to match (in order for sufficient similarity to be concluded). In sum: It sounds like an overly complicated and time consuming/slow approach. Interestingly, in SQL Server 2008 you have the DIFFERENCE function which may be used for something like this. It evaluates the phonetic value of two strings and calculates the difference. I'm unsure if you will get it to work properly for multi-word expressions such as movie titles since it doesn't deal well with spaces or numbers and puts too much emphasis on the beginning of the string, but it is still an interesting predicate to be aware of. <a href="http://msdn.microsoft.com/en-us/library/ms188753.aspx" rel="noreferrer">http://msdn.microsoft.com/en-us/library/ms188753.aspx</a> If what you are actually trying to describe is some sort of search feature, then you should look into the <a href="http://msdn.microsoft.com/en-us/library/ms142571.aspx" rel="noreferrer">Full Text Search</a> capabilities of SQL Server 2008. It provides built-in <a href="http://msdn.microsoft.com/en-us/library/ms142491.aspx" rel="noreferrer">Thesaurus support</a>, fancy SQL <a href="http://msdn.microsoft.com/en-us/library/ms187787.aspx" rel="noreferrer">predicates</a> and a ranking mechanism for "best matches" EDIT: If you are looking to eliminate duplicates maybe you could look into SSIS <a href="http://msdn.microsoft.com/nb-no/library/ms345128.aspx" rel="noreferrer">Fuzzy Lookup and Fuzzy Group Transformation</a>. I have not tried this myself, but it looks like a promising lead. EDIT2: If you don't want to dig into SSIS and still struggle with the performance of the Levensthein Distance algorithm, you could perhaps try this <a href="http://sites.google.com/site/sqlblindman/fuzzysearchalgorithm" rel="noreferrer">algorithm</a> which appears to be less complex.
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POFind sql records containing similar strings
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USSimen S
UserOwnerUserId
1. USSimen S
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POFind sql records containing similar strings
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.