StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
5560023
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
1
CommunityOwnedDate
CreationDate
2011-04-06T00:00:41.753
FavoriteCount
0
LastActivityDate
2011-05-26T16:42:55.287
LastEditDate
2017-05-23T11:48:23.740
LastEditorUserId
-1
OwnerUserId
23771
ParentId
5525758
PostTypeId
2
Score
6
ViewCount
0
LastEditorDisplayName
text
Body
Do you mind too much if I talk a bit about profiling, what works and what doesn't? Let's make up an artificial program, some of whose statements are doing work that can be optimized away - i.e. they are not really necessary. They are "bottlenecks". Subroutine <code>foo</code> runs a CPU-bound loop that takes one second. Also assume subroutine CALL and RETURN instructions take insignificant or zero time, compared to everything else. Subroutine <code>bar</code> calls <code>foo</code> 10 times, but 9 of those times are unnecessary, which you don't know in advance and can't tell until your attention is directed there. Subroutines <code>A</code>, <code>B</code>, <code>C</code>, ..., <code>J</code> are 10 subroutines, and they each call <code>bar</code> once. The top-level routine <code>main</code> calls each of <code>A</code> through <code>J</code> once. So the total call tree looks like this: <pre><code>main A bar foo foo ... total 10 times for 10 seconds B bar foo foo ... ... J ... (finished) </code></pre> How long does it all take? 100 seconds, obviously. Now let's look at profiling strategies. Stack samples (like say 1000 samples) are taken at uniform intervals. <ol> <li>Is there any self time? Yes. <code>foo</code> takes 100% of the self time. It's a genuine "hot spot". Does that help you find the bottleneck? No. Because it is not in <code>foo</code>.</li> <li>What is the hot path? Well, the stack samples look like this: main -> A -> bar -> foo (100 samples, or 10%) main -> B -> bar -> foo (100 samples, or 10%) ... main -> J -> bar -> foo (100 samples, or 10%)</li> </ol> There are 10 hot paths, and none of them look big enough to gain you much speedup. IF YOU HAPPEN TO GUESS, and IF THE PROFILER ALLOWS, you could make <code>bar</code> the "root" of your call tree. Then you would see this: <pre><code>bar -> foo (1000 samples, or 100%) </code></pre> Then you would know that <code>foo</code> and <code>bar</code> were each independently responsible for 100% of the time and therefore are places to look for optimization. You look at <code>foo</code>, but of course you know the problem isn't there. Then you look at <code>bar</code> and you see the 10 calls to <code>foo</code>, and you see that 9 of them are unnecessary. Problem solved. IF YOU DIDN'T HAPPEN TO GUESS, and instead the profiler simply showed you the percent of samples containing each routine, you would see this: <pre><code>main 100% bar 100% foo 100% A 10% B 10% ... J 10% </code></pre> That tells you to look at <code>main</code>, <code>bar</code>, and <code>foo</code>. You see that <code>main</code> and <code>foo</code> are innocent. You look at where <code>bar</code> calls <code>foo</code> and you see the problem, so it's solved. It's even clearer if in addition to showing you the functions, you can be shown the lines where the functions are called. That way, you can find the problem no matter how large the functions are in terms of source text. NOW, let's change <code>foo</code> so that it does <code>sleep(oneSecond)</code> rather than be CPU bound. How does that change things? What it means is it still takes 100 seconds by the wall clock, but the CPU time is zero. Sampling in a CPU-only sampler will show nothing. So now you are told to try instrumentation instead of sampling. Contained among all the things it tells you, it also tells you the percentages shown above, so in this case you could find the problem, assuming <code>bar</code> was not very big. (There may be reasons to write small functions, but should satisfying the profiler be one of them?) Actually, the main thing wrong with the sampler was that it can't sample during <code>sleep</code> (or I/O or other blocking), and it doesn't show you code line percents, only function percents. By the way, 1000 samples gives you nice precise-looking percents. Suppose you took fewer samples. How many do you actually need to find the bottleneck? Well, since the bottleneck is on the stack 90% of the time, if you took only 10 samples, it would be on about 9 of them, so you'd still see it. If you even took as few as 3 samples, the probability it would appear on two or more of them is 97.2%.** High sample rates are way overrated, when your goal is to find bottlenecks. Anyway, that's why I rely on <a href="https://stackoverflow.com/questions/375913/what-can-i-use-to-profile-c-code-in-linux/378024#378024">random-pausing</a>. ** How did I get 97.2 percent? Think of it as tossing a coin 3 times, a very unfair coin, where "1" means seeing the bottleneck. There are 8 possibilities: <pre><code> #1s probabality 0 0 0 0 0.1^3 * 0.9^0 = 0.001 0 0 1 1 0.1^2 * 0.9^1 = 0.009 0 1 0 1 0.1^2 * 0.9^1 = 0.009 0 1 1 2 0.1^1 * 0.9^2 = 0.081 1 0 0 1 0.1^2 * 0.9^1 = 0.009 1 0 1 2 0.1^1 * 0.9^2 = 0.081 1 1 0 2 0.1^1 * 0.9^2 = 0.081 1 1 1 3 0.1^0 * 0.9^3 = 0.729 </code></pre> so the probability of seeing it 2 or 3 times is .081*3 + .729 = .972
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POFunction profiling woes - Visual Studio 2010 Ultimate
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USCommunity
UserOwnerUserId
1. USMike Dunlavey
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. COGreat information, thanks! :)
 singulars
 PostPostId
 PO
 UserUserId
 USJamie Keeling

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.