StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
20195095
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
4
CommunityOwnedDate
CreationDate
2013-11-25T14:05:04.390
FavoriteCount
0
LastActivityDate
2013-11-25T14:40:51.153
LastEditDate
2013-11-25T14:40:51.153
LastEditorUserId
3032443
OwnerUserId
3032443
ParentId
20194419
PostTypeId
2
Score
1
ViewCount
0
LastEditorDisplayName
text
Body
This can be viewed as a <a href="http://en.wikipedia.org/wiki/Binary_classification" rel="nofollow">binary (yes or no) classification task</a>. You could write a rule-based model to classify or a statistics-based model. A rule-based model would be like <code>if answer in ["never", "not at this time", "nope"] then answer is "no"</code>. When spam filters first came out they contained a lot of rules like these. A statistics-based model would probably be more suitable here, as writing your own rules gets tiresome and does not handle new cases as well. For this you need to label a <a href="http://en.wikipedia.org/wiki/Training_set" rel="nofollow">training dataset</a>. After a little preprocessing (like lowercasing all the words, removing punctuation and maybe even a little stemming) you could get a dataset like <pre><code>0 | never in a million years 0 | never 1 | yes sir 1 | yep 1 | yes yes yeah 0 | no way </code></pre> Now you can run classification algorithms like Naive Bayes or Logistic Regression over this set (after you vectorize the words in either binary, which means is the word present or not, word count, which means the term frequency, or a tfidf float, which prevent bias to longer answers and common words) and learn which words more often belong to which class. In the above example <code>yes</code> would be strongly correlated to a positive answer (1) and <code>never</code> would be strongly related to a negative answer (0). You could work with n-grams so a <code>not no</code> would be treated as a single token in favor of the positive class. This is called the bag-of-words approach. To combat spelling errors you can add a spellchecker like Aspell to the pre-processing step. You could use a charvectorizer too, so a word like <code>nno</code> would be interpreted as <code>nn</code> and <code>no</code> and you catch errors like <code>hellyes</code> and you could trust your users to repeat spelling errors. If 5 users make the spelling error <code>neve</code> for the word <code>never</code> then the token <code>neve</code> will automatically start to count for the negative class (if labeled as such). You could write these algorithms yourself (Naive Bayes is doable, Paul Graham has wrote a few accessible essays on how to classify spam with Bayes Theorem and nearly every ML library has a tutorial on how to do this) or make use of libraries or programs like Scikit-Learn (MultinomialNB, SGDclassifier, LinearSVC etc.) or Vowpal Wabbit (logistic regression, quantile loss etc.).
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POparsing text of a yes/no query
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USPake Beet
UserOwnerUserId
1. USPake Beet
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POparsing text of a yes/no query
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTAcceptedByOriginator
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.