StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
9915714
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
7
CommunityOwnedDate
CreationDate
2012-03-28T21:11:35.157
FavoriteCount
0
LastActivityDate
2012-03-30T08:28:48.227
LastEditDate
2012-03-30T08:28:48.227
LastEditorUserId
587803
OwnerUserId
587803
ParentId
9913446
PostTypeId
2
Score
1
ViewCount
0
LastEditorDisplayName
text
Body
SQL is easier to understand inside out. <blockquote> The problem is to identify the percentage of times that 'Attribute6' is populated for the set of rows when there are more than 1 value for 'Attribute5' when 'Attibute1' to 'Attribute4' were the same. </blockquote> Breaks down like this: <ol> <li>Where attributes 1-4 are the same</li> <li>And there is more than one value for attribute5</li> <li>Give the percentage of times Attribute6 is populated</li> </ol> Like so: <pre><code>select attribute1, attribute2, attribute3, attribute4, -- 3. give percentage of times Attribute6 is populated -- Percentage is numerator * 100 over denominator -- 3.a. Numerator: Number of times attribute 6 is populated sum( case when attribute6 is null then 0 else 1 end) * 100 / -- 3.b. Denominator: Total number of attribute5 found count(attribute5) from Problem p -- 1. where attributes 1-4 are the same group by attribute1, attribute2, attribute3, attribute4 -- 2. And there is more than one value for attribute5 having count(distinct attribute5) > 1 </code></pre> You haven't been clear on the definiton of "attribute5 has more than one value" - I have assumed you mean more than one distinct value. If you just meant "not null" that is easy too - just replace the count(distinct) with the appropriate expression to get what you want. <h3>Edit</h3> With the added clarity that we are looking for a single number, which is the percentage of groups where there are multiple distinct values of Attribute5, which also have multiple values of attribute6. It's not clear how you want to handle nulls and empty strings, so I am assuming that there are no nulls and empty strings count as a normal value. Try the following: <pre><code>select sum(nDistinct5) as nDemoninator, sum(nDistinct6) as nNumerator, sum(nDistinct6) * 100.0 / sum(nDistinct5) from ( select attribute1, attribute2, attribute3, attribute4, -- 3. give percentage of times Attribute6 is populated -- Percentage is numerator * 100 over denominator -- 3.a. Numerator: Number of times attribute 6 is populated count(distinct attribute6) as nDistinct6, -- 3.b. Denominator: Total number of attribute5 found sum(1) as nDistinct5 from Problem p -- 1. where attributes 1-4 are the same group by attribute1, attribute2, attribute3, attribute4 -- 2. And there is more than one value for attribute5 having count(distinct attribute5) > 1 ) g </code></pre> For eyeballing purposes, join the original data onto the subquery g so you can manually confirm the logic is correct. <pre><code>select p.*, g.nDistinct6, g.nDistinct5 from ( select attribute1, attribute2, attribute3, attribute4, -- 3. give percentage of times Attribute6 is populated -- Percentage is numerator * 100 over denominator -- 3.a. Numerator: Number of times attribute 6 is populated count(distinct attribute6) as nDistinct6, -- 3.b. Denominator: Total number of attribute5 found sum(1) as nDistinct5 from Problem p -- 1. where attributes 1-4 are the same group by attribute1, attribute2, attribute3, attribute4 -- 2. And there is more than one value for attribute5 having count(distinct attribute5) > 1 ) g right outer join Problem p on p.attribute1 = g.attribute1 and p.attribute2 = g.attribute2 and p.attribute3 = g.attribute3 and p.attribute4 = g.attribute4 order by p.attribute1, p.attribute2, p.attribute3, p.attribute4 </code></pre> This displayes every row from <code>Problem</code>, and the corresponding group totals for number of distinct <code>Attribute6</code> and <code>Attribute5</code>, so you can validate that these are indeed the numbers you want to use. If there are too many rows and you just want to eyeball a few hundred, you can use <code>top</code>
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POSQL Server 2008: How to get a Detail ID when doing a multi-column GROUP BY and HAVING
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USBen
UserOwnerUserId
1. USBen
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POSQL Server 2008: How to get a Detail ID when doing a multi-column GROUP BY and HAVING
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTAcceptedByOriginator
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.