StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
2181110
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2010-02-02T00:31:50.570
FavoriteCount
0
LastActivityDate
2010-02-02T01:43:33.577
LastEditDate
2010-02-02T01:43:33.577
LastEditorUserId
166686
OwnerUserId
166686
ParentId
2180915
PostTypeId
2
Score
4
ViewCount
0
LastEditorDisplayName
text
Body
No doubt, Google News may use other tricks (or even a combination thereof), but one relatively cheap trick, computationally, to infer topics from free-text would exploit the NLP notion that a word gets its meaning only when connected to other words. An algorithm susceptible of discovering new topic categories from multiple documents could be outlined as follow: <ul> <li>POS (part-of-speech) tag the text We probably want to focus more on nouns and maybe even more so on named entities (such as Obama or New England)</li> <li>Normalize the text In particular replace inflected words by their common stem. Maybe even replace some adjectives by a corresponding Named Entity (ex: Parisian ==> Paris, legal ==> law) Also, remove noise words and noise expressions.</li> <li>identify some words from a list of manually maintained "current / recurring hot words" (Superbowl, Elections, scandal...) This can be used in subsequent steps to provide more weight to some N-grams</li> <li>Enumerate all N-grams found in each documents (where N is 1 to say 4 or 5) Be sure to count, separately, the number of occurrences of each N-gram within a given document and the number of documents which cite a given N-gram</li> <li>The most frequently cited N-grams (i.e. the ones cited in the most documents) are probably the Topics.</li> <li>Identify the existing topics (from a list of known topics)</li> <li>[optionally] Manually review the new topics</li> </ul> This general recipe can also be altered to leverage other attributes of the documents and the text therein. For example the document origin (say cnn/sports vs. cnn/politics ...) can be used to select domain specific lexicons. Another example the process can more or less heavily emphasize the words/expressions from the document title (or other areas of the text with a particular mark-up).
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POblindly classifying new trends in incoming data
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USmjv
UserOwnerUserId
1. USmjv
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.