StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POBasic set-based operaions using a document database (noSQL)
primarykey
Id
6712263
data
AcceptedAnswerId
6745580
AnswerCount
2
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2011-07-15T19:34:52.047
FavoriteCount
1
LastActivityDate
2011-07-19T10:16:32.707
LastEditDate
2011-07-15T19:40:55.513
LastEditorUserId
324240
OwnerUserId
324240
ParentId
0
PostTypeId
1
Score
3
ViewCount
629
LastEditorDisplayName
text
Body
As with most, I come from and RDMS world trying to get my head around noSQL databases and specifically document stores (as I find them the most interesting). I am try to understand how to perform some set-based operations using a document database (I'm playing with RavenDB). So as per my understanding: <ul> <li>Union (as in SQL UNION) is very straight forward append. Additionally unions between different sets (SQL JOIN) can be achieved map/reduce. The example given in the RavenDB mythology book with Comment counts on Blogs entries is a good start.</li> <li>Intersection can be performed using a number of techniques from de-normalization right through to creating a “mapping” or “link” document as described <a href="http://groups.google.com/group/ravendb/browse_thread/thread/915e11613c58f738/22a45073297d588d?pli=1" rel="nofollow">here</a> (and the aggregator example below). In an RDMS this would be performed using a simple "INNER JOIN" or "WHERE x IN"</li> <li>Subtract (Relative Complement) is where I am getting stuck. In an RDMS this operation is simply a "WHERE x NOT IN" or a "LEFT JOIN" where the joined set is NULL.</li> </ul> Using a real world example let’s say we have an RSS aggregator (such as Google Reader) which has millions if not billions of RSS entries with thousands of users, each tagging favourite, etc. In this example we focus on entry, user and tag; where tag acts as a link between user and entry. <pre><code>user {string id, string name /*etc.*/} entry {string id, string title, string url /*etc.*/} tag {string userId, string entryId, string[] tags} /* (favourite, read, etc.)*/ </code></pre> With the above approach it is easy to perform the intersection between entry and user using tag. But I cannot get my head around how one would perform a subtract. For instance “Return all items that do not have any tags” or even more daunting “return the latest 1000 items without any tag”. So my question: <ul> <li>Can you point me to some reading material on the matter? </li> <li>Can you share some ideas on how one can accomplish the task efficiently?</li> </ul> Note: I know that you lose query flexibility with document databases, but surely there must be a way to do this?
Tags
<nosql><ravendb><except><complement><rdms>
Title
Basic set-based operaions using a document database (noSQL)
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USamok
UserOwnerUserId
1. USamok
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POBasic set-based operaions using a document database (noSQL)
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POBasic set-based operaions using a document database (noSQL)
 UserUserId
 USRohland
 VoteTypeVoteTypeId
 VTFavorite
3. VO
 singulars
 PostPostId
 POBasic set-based operaions using a document database (noSQL)
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.