StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
4419542
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
6
CommunityOwnedDate
CreationDate
2010-12-11T23:24:21.750
FavoriteCount
0
LastActivityDate
2010-12-12T00:00:46.863
LastEditDate
2010-12-12T00:00:46.863
LastEditorUserId
236047
OwnerUserId
236047
ParentId
4419499
PostTypeId
2
Score
24
ViewCount
0
LastEditorDisplayName
text
Body
EDIT: Your one-column indices are not enough. You would need to, at least, cover the three involved columns. More advanced solution: replace <code>replycount > 1</code> with <code>hasreplies = 1</code> by creating a new <code>hasreplies</code> field that equals 1 when <code>replycount > 1</code>. Once this is done, create an index on the three columns, in that order: <code>INDEX(forumid, hasreplies, dateline)</code>. Make sure it's a BTREE index to support ordering. You're selecting based on: <ul> <li>a given <code>forumid</code></li> <li>a given <code>hasreplies</code></li> <li>ordered by <code>dateline</code></li> </ul> Once you do this, your query execution will involve: <ul> <li>moving down the BTREE to find the subtree that matches <code>forumid = X</code>. This is a logarithmic operation (duration : log(number of forums)). </li> <li>moving further down the BTREE to find the subtree that matches <code>hasreplies = 1</code> (while still matching <code>forumid = X</code>). This is a constant-time operation, because <code>hasreplies</code> is only 0 or 1. </li> <li>moving through the dateline-sorted subtree in order to get the required results, without having to read and re-sort the entire list of items in the forum.</li> </ul> My earlier suggestion to index on <code>replycount</code> was incorrect, because it would have been a range query and thus prevented the use of a <code>dateline</code> to sort the results (so you would have selected the threads with replies very fast, but the resulting million-line list would have had to be sorted completely before looking for the 100 elements you needed). IMPORTANT: while this improves performance in all cases, your huge OFFSET value (10000!) is going to decrease performance, because MySQL does not seem to be able to skip ahead despite reading straight through a BTREE. So, the larger your OFFSET is, the slower the request will become. I'm afraid the OFFSET problem is not automagically solved by spreading the computation over several computations (how do you skip an offset in parallel, anyway?) or moving to NoSQL. All solutions (including NoSQL ones) will boil down to simulating OFFSET based on <code>dateline</code> (basically saying <code>dateline > Y LIMIT 100</code> instead of <code>LIMIT Z, 100</code> where <code>Y</code> is the date of the item at offset <code>Z</code>). This works, and eliminates any performance issues related to the offset, but prevents going directly to page 100 out of 200.
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POMySQL and NoSQL: Help me to choose the right one
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USVictor Nicollet
UserOwnerUserId
1. USVictor Nicollet
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.