StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POOptimize SQL subquery containing multiple inner joins and aggregate functions
primarykey
Id
13272553
data
AcceptedAnswerId
13413492
AnswerCount
2
ClosedDate
CommentCount
3
CommunityOwnedDate
CreationDate
2012-11-07T15:15:28.593
FavoriteCount
1
LastActivityDate
2013-10-22T12:18:51.647
LastEditDate
2013-10-22T12:18:51.647
LastEditorUserId
1806414
OwnerUserId
1806414
ParentId
0
PostTypeId
1
Score
1
ViewCount
2165
LastEditorDisplayName
text
Body
I have a select statement which is infact a subquery within a larger select statement built up programmatically. The problem is if I elect to include this subquery it acts as a bottle neck and the whole query becomes painfully slow. An example of the data is as follows: <pre><code>Payment .Receipt_no|.Person |.Payment_date|.Type|.Reversed| 2|John |01/02/2001 |PA | | 1|John |01/02/2001 |GX | | 3|David |15/04/2003 |PA | | 6|Mike |26/07/2002 |PA |R | 5|John |01/01/2001 |PA | | 4|Mike |13/05/2000 |GX | | 8|Mike |27/11/2004 |PA | | 7|David |05/12/2003 |PA |R | 9|David |15/04/2003 |PA | | </code></pre> The subquery is as follows : <pre><code>select Payment.Person, Payment.amount from Payment inner join (Select min([min_Receipt].Person) 'Person', min([min_Receipt].Receipt_no) 'Receipt_no' from Payment [min_Receipt] inner join (select min(Person) 'Person', min(Payment_date) 'Payment_date' from Payment where Payment.reversed != 'R' and Payment.Type != 'GX' group by Payment.Person) [min_date] on [min_date].Person= [min_Receipt].Person and [min_date].Payment_date = [min_Receipt].Payment_date where [min_Receipt].reversed != 'R' and [min_Receipt].Type != 'GX' group by [min_Receipt].Person) [1stPayment] on [1stPayment].Receipt_no = Payment.Receipt_no </code></pre> This retrieves the first payment of each person by .Payment_date (ascending), .Receipt_no (ascending) where .type is not 'GX' and .Reversed is not 'R'. As Follows: <pre><code>Payment .Receipt_No|.Person|.Payment_date 5|John |01/01/2001 3|David |15/04/2003 8|Mike |27/11/2004 </code></pre> <h2>Following Ahmads post -</h2> From the following results <pre><code>(3|David |15/04/2003) and (9|David |15/04/2003) </code></pre> I would only want the record with the lowest receipt_no. So <pre><code>(3|David |15/04/2003) </code></pre> So I added the aggregate function 'min(Payment.receipt_no)' grouping by person. Query 1. <pre><code>select min(Payment.Person) 'Person', min(Payment.receipt_no) 'receipt_no' from Payment a where a.type<>'GX' and (a.reversed not in ('R') or a.reversed is null) and a.payment_date = (select min(payment_date) from Payment i where i.Person=a.Person and i.type <> 'GX' and (i.reversed not in ('R') or i.reversed is null)) group by a.Person </code></pre> I added this as a subquery within my much larger query, however it still ran very slowly. So I tried rewriting the query whilst trying to avoid the use of aggregate functions and came up with the following. Query 2. <pre><code>SELECT receipt_no, person, payment_date, amount FROM payment a WHERE receipt_no IN (SELECT top 1 i.receipt_no FROM payment i WHERE (i.reversed NOT IN ('R') OR i.reversed IS NULL) AND i.type<>'GX' AND i.person = a.person ORDER BY i.payment_date DESC, i.receipt_no ASC) </code></pre> Which I wouldn't necessarily think as more efficient. In fact if I run the two queries side by side on my larger data set Query 1. completes in a matter of milliseconds where as Query 2. takes several seconds. However if I then add them as subqueries within a much larger query, the larger query completes in hours using Query 1. and completes in 40 seconds using Query 2. I can only attribute this to the use of aggregate functions in one and not the other.
Tags
<sql><sql-server><optimization><group-by><aggregate-functions>
Title
Optimize SQL subquery containing multiple inner joins and aggregate functions
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USDMK
UserOwnerUserId
1. USDMK
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POOptimize SQL subquery containing multiple inner joins and aggregate functions
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POOptimize SQL subquery containing multiple inner joins and aggregate functions
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTApproveEditSuggestion
3. VO
 singulars
 PostPostId
 POOptimize SQL subquery containing multiple inner joins and aggregate functions
 UserUserId
 USDMK
 VoteTypeVoteTypeId
 VTFavorite
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.