StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
965086
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2009-06-08T14:08:25.777
FavoriteCount
0
LastActivityDate
2009-06-08T14:37:00.727
LastEditDate
2009-06-08T14:37:00.727
LastEditorUserId
115293
OwnerUserId
115293
ParentId
964828
PostTypeId
2
Score
12
ViewCount
0
LastEditorDisplayName
text
Body
My 2 cents. Given the fact that translate.google.com is a statistical machine translation engine and "The Unreasonable Effectiveness of Data" from A Halevy, P Norvig (Director of Research at Google) & F Pereira: I make the assumption (bet) that this is a statistically driven spell checker. How it could work: you collect a very large corpus of the language you want to spell check. You store this corpus as phrase-tables in adapted datastructures (<a href="http://en.wikipedia.org/wiki/Suffix_array" rel="noreferrer">suffix arrays</a> for example if you have to count the <a href="http://en.wikipedia.org/wiki/N-gram" rel="noreferrer">n-grams</a> subsets) that keep track of the count (an so an estimated probability of) the number of n-grams. For example, if your corpus is only constitued of: <pre><code>I had bean soup last diner. </code></pre> From this entry, you will generate the following bi-grams (sets of 2 words): <pre><code>I had, had bean, bean soup, soup last, last diner </code></pre> and the tri-grams (sets of 3 words): <pre><code>I had bean, had bean soup, bean soup last, soup last diner </code></pre> But they will be pruned by tests of statistical relevance, for example: we can assume that the tri-gram <pre><code>I had bean </code></pre> will disappear of the phrase-table. Now, spell checking is only going to look is this big phrase-tables and check the "probabilities". (You need a good infrastructure to store this phrase-tables in an efficient data structure and in RAM, Google has it for translate.google.com, why not for that ? It's easier than statistical machine translation.) Ex: you type <pre><code>I had been soup </code></pre> and in the phrase-table there is a <pre><code>had bean soup </code></pre> tri-gram with a much higher probability than what you just typed! Indeed, you only need to change one word (this is a "not so distant" tri-gram) to have a tri-gram with a much higher probability. There should be an evaluating function dealing with the trade-off distance/probability. This distance could even be calculated in terms of characters: we are doing spell checking, not machine translation. This is only my hypothetical opinion. ;)
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POContext Specific Spelling Engine
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USSnippyHolloW
UserOwnerUserId
1. USSnippyHolloW
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POContext Specific Spelling Engine
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTBountyClose
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.