StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
2783356
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
1
CommunityOwnedDate
CreationDate
2010-05-06T18:07:51.057
FavoriteCount
0
LastActivityDate
2010-05-07T03:33:02.570
LastEditDate
2010-05-07T03:33:02.570
LastEditorUserId
86542
OwnerUserId
86542
ParentId
2781752
PostTypeId
2
Score
26
ViewCount
0
LastEditorDisplayName
text
Body
Existing Implementations of Naive Bayes You would probably be better off just using one of the existing packages that supports document classification using naive Bayes, e.g.: Python - To do this using the Python based <a href="http://www.nltk.org/" rel="noreferrer">Natural Language Toolkit (NLTK)</a>, see the <a href="http://nltk.googlecode.com/svn/trunk/doc/book/ch06.html#document-classification" rel="noreferrer">Document Classification</a> section in the freely available <a href="http://www.nltk.org/book" rel="noreferrer">NLTK book</a>. Ruby - If Ruby is more of your thing, you can use the <a href="http://classifier.rubyforge.org/" rel="noreferrer">Classifier</a> gem. Here's sample code that detects <a href="http://www.igvita.com/2007/05/23/bayes-classification-in-ruby/" rel="noreferrer"> whether Family Guy quotes are funny or not-funny</a>. Perl - Perl has the <a href="http://search.cpan.org/dist/Algorithm-NaiveBayes/lib/Algorithm/NaiveBayes.pm" rel="noreferrer">Algorithm::NaiveBayes</a> module, complete with a sample usage snippet in the package <a href="http://search.cpan.org/dist/Algorithm-NaiveBayes/lib/Algorithm/NaiveBayes.pm#SYNOPSIS" rel="noreferrer">synopsis</a>. C# - C# programmers can use <a href="http://nbayes.codeplex.com/" rel="noreferrer">nBayes</a>. The project's home page has sample code for a simple spam/not-spam classifier. Java - Java folks have <a href="http://classifier4j.sourceforge.net/" rel="noreferrer">Classifier4J</a>. You can see a training and scoring code snippet <a href="http://classifier4j.sourceforge.net/usage.html#Using_BayesianClassifier" rel="noreferrer">here</a>. Bootstrapping Classification from Keywords It sounds like you want to start with a set of keywords that are known to cue for certain topics and then use those keywords to <a href="http://en.wikipedia.org/wiki/Bootstrapping_%28machine_learning%29" rel="noreferrer">bootstrap a classifier</a>. This is a reasonably clever idea. Take a look at the paper <a href="http://www.kamalnigam.com/papers/keywordcat-aclws99.pdf" rel="noreferrer">Text Classication by Bootstrapping with Keywords, EM and Shrinkage</a> by McCallum and Nigam (1999). By following this approach, they were able to improve classification accuracy from the 45% they got by using hard-coded keywords alone to 66% using a bootstrapped Naive Bayes classifier. For their data, the latter is close to human levels of agreement, as people agreed with each other about document labels 72% of the time.
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. PONaive Bayesian for Topic detection using "Bag of Words" approach
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USdmcer
UserOwnerUserId
1. USdmcer
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. PONaive Bayesian for Topic detection using "Bag of Words" approach
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.