StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
18250129
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
2
CommunityOwnedDate
CreationDate
2013-08-15T09:43:55.200
FavoriteCount
0
LastActivityDate
2013-08-15T10:13:46.510
LastEditDate
2013-08-15T10:13:46.510
LastEditorUserId
567864
OwnerUserId
567864
ParentId
18249987
PostTypeId
2
Score
3
ViewCount
0
LastEditorDisplayName
text
Body
In certain situations, I would expect foo3 to be faster as it can short circuit (some number of branches less than or equal to 4 will occur, whereas in foo2, 4 branches always occurs). In the situation where <code>s</code> is not equal to any of the 4 array elements (as is extremely likely in this case), foo2 and foo3 are basically the same code. In that case, 4 branches happen in both functions. Consider what foo3 really looks like (in terms of branches): <pre><code>if (p[i + 0] == s) sum++; else if (p[i + 1] == s) sum++; else if (p[i + 2] == s) sum++; else if (p[i + 3] == s) sum++; </code></pre> This should make it apparent that as long as the <code>if</code> keep coming up false, the sub branches are going to happen. This means that in the situation where none of the ifs are true, it will execute the same number of operations as foo2 (though not the same functionality). A crude way to think about it is as if each <code>if</code> has a cost (not the body of the if, the actual if). In other words, each time an <code>if</code> is reached in the execution flow, a certain cost is required. This is because a branch must be done. Thinking about it this way, it's clear to see that the cost of each function is the same when foo3's flow doesn't short circuit (when all 4 of <code>foo3</code>s <code>if</code> are encountered). (As KillianDS noted, if branch prediction is wrong, it will actually take longer for foo3 since the wrong branch will have to be rewound and the right one executed instead. It seems like for you though that the correct branch is always being chosen.) It's kind of like how the following snippets of code can have the same performance: <pre><code>if (short_runtime()) {} </code></pre> And: <pre><code>if (short_runtime() && long_runtime()) {} </code></pre> If <code>short_runtime</code> returns true, the one with the second function call is obviously going to take longer. If the <code>short_runtime()</code> return is false though, the <code>long_runtime()</code> call will never happen, and thus the run times will be the same (or at least extremely similar). <hr> To test out this theory, you can make it so that <code>p[i + 0] == s</code> will be true. Just value initialize the array (<code>session* p = new session[SIZE]();</code>), and use <code>session s = {1, 2, 3, 4, 5};</code> locally. <hr> There seems to be a bit of confusion about the purpose/result of loop unrolling. It's done so that fewer jumps have to happen. If <code>n</code> things have to be done, instead of <code>n</code> iterations (jumps) happening with 1 action per iteration, you can have <code>n/k</code> iterations (jumps) happen instead. When everything can fit in the cache, this can provide a speed boost (if it can't fit in the cache, it can actually kill performance!). The instructions aren't happening simultaneously (if they were, <code>sum</code> would need a mutex around it which would be extremely expensive). They're simply happening in sets of 4 instead of sets of 1.
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POIn "loop unrolling", are all unrolled expressions executed?
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USCorbin
UserOwnerUserId
1. USCorbin
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POIn "loop unrolling", are all unrolled expressions executed?
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTAcceptedByOriginator
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. COYes. Your theory is right! I have tested it. It seems that we just save the long_runtime() and the branch miss time.
 singulars
 PostPostId
 PO
 UserUserId
 US大宝剑
2. COMy test 100 with {1, 2, 3, 4, 5}: 2.46 0.23 0.15
 singulars
 PostPostId
 PO
 UserUserId
 USDieter Lücking

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.