StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
11404521
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
8
CommunityOwnedDate
CreationDate
2012-07-09T23:09:31.410
FavoriteCount
0
LastActivityDate
2012-07-11T03:38:29.207
LastEditDate
2017-05-23T11:45:44.440
LastEditorUserId
-1
OwnerUserId
1354190
ParentId
11336950
PostTypeId
2
Score
1
ViewCount
0
LastEditorDisplayName
text
Body
You probably need to use <a href="https://cwiki.apache.org/Hive/languagemanual-transform.html" rel="nofollow noreferrer">Hive transform functionality</a> and have a custom reducer that does the matching between the records from the two tables: t1 and t2 where t1 is simply TestingTable1 and t2 is <pre><code> SELECT user_id, prod_and_ts.product_id as product_id, prod_and_ts.timestamps as timestamps FROM TestingTable2 LATERAL VIEW explode(purchased_item) exploded_table as prod_and_ts </code></pre> <a href="https://stackoverflow.com/questions/11373543/explode-the-array-of-struct-in-hive#comment15007330_11373543">as explained by me in another question of yours</a>. <pre><code>FROM ( FROM ( SELECT buyer_id, item_id, created_time, id FROM ( SELECT buyer_id, item_id, created_time, 't1' as id FROM TestingTable1 t1 UNION ALL SELECT user_id as buyer_id, prod_and_ts.product_id as item_id, prod_and_ts.timestamps as created_time, 't2' as id FROM TestingTable2 LATERAL VIEW explode(purchased_item) exploded_table as prod_and_ts )t )x MAP buyer_id, item_id, created_time, id USING '/bin/cat' AS buyer_id, item_id, create_time, id CLUSTER BY buyer_id ) map_output REDUCE buyer_id, item_id, create_time, id USING 'my_custom_reducer' AS buyer_id, item_id, create_time, product_id, timestamps; </code></pre> The above query has 2 distinct portions. The first part is "MAP" and the other is "REDUCE". In between these 2 parts is a phase called shuffle (represented by <code>CLUSTER BY buyer_id</code>) that is automatically taken care of my Hive. The Map part of the query reads from tables and also passes an identifier (called id that represents which tables the record is coming from). The Shuffle phase groups all the records per buyer_id. The Reduce phase will take in the all records for a given buyer_id and emit out only the records that satisfy the matching criteria. You will have to write the reducer yourself based on your matching criteria. You can write it in any language of your choice. It's guaranteed that all records that have the same buyer_id will go to the same reducer script. There might be an easier way to do but this is the method I can think of right now. Good luck! To gain further appreciation of why I chose this method, <a href="https://stackoverflow.com/questions/11387543/performance-tuning-a-hive-query/11405841#11405841">see my recent answer here</a>.
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POJoining two Tables in Hive using HiveQL(Hadoop)
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USCommunity
UserOwnerUserId
1. USMark Grover
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUndeletion
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.