StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POhadoop key returns multiple values
primarykey
Id
20501302
data
AcceptedAnswerId
0
AnswerCount
1
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2013-12-10T17:31:47.407
FavoriteCount
0
LastActivityDate
2013-12-10T18:36:30.880
LastEditDate
LastEditorUserId
0
OwnerUserId
3087780
ParentId
0
PostTypeId
1
Score
0
ViewCount
614
LastEditorDisplayName
text
Body
This is my first time programming in Hadoop and I'm basing my assignment off of WordCount v1.0 on the hadoop tutorial website. Assignment: You have two files. File0 contains every word in the dictionary. File1 contains one random word. For example: 'beautiful'. And when I run the program it should return every word in File0 that is the same size as the word in File1 *beautiful will return every 9 letter word in the dictionary For example: beautiful - AARDVARKS AASVOGELS ABAMPERES....ZYMOGRAMS ZYMURGIES So my question is how should I go about this? The hadoop wordcount v1.0 returns the key and a single value. ---- e.g. (beautiful 4) Do I need to change the value from an int to a string or maybe some sort of an array that contains every word of the same size as the key? *basically I need to change the format from (beautiful 4) to (beautiful: AARDVARKS AASVOGELS ABAMPERES...ZYMOGENES ZYMOGRAMS ZYMURGIES) Here is the code (from their website): <pre><code>package org.myorg; import java.io.IOException; import java.util.*; import org.apache.hadoop.fs.Path; import org.apache.hadoop.conf.*; import org.apache.hadoop.io.*; import org.apache.hadoop.mapred.*; import org.apache.hadoop.util.*; public class WordCount { public static class Map extends MapReduceBase implements Mapper<LongWritable, Text, Text, IntWritable> { private final static IntWritable one = new IntWritable(1); private Text word = new Text(); public void map(LongWritable key, Text value, OutputCollector<Text, IntWritable> output, Reporter reporter) throws IOException { String line = value.toString(); StringTokenizer tokenizer = new StringTokenizer(line); while (tokenizer.hasMoreTokens()) { word.set(tokenizer.nextToken()); output.collect(word, one); } } } public static class Reduce extends MapReduceBase implements Reducer<Text, IntWritable, Text, IntWritable> { public void reduce(Text key, Iterator<IntWritable> values, OutputCollector<Text, IntWritable> output, Reporter reporter) throws IOException { int sum = 0; while (values.hasNext()) { sum += values.next().get(); } output.collect(key, new IntWritable(sum)); } } public static void main(String[] args) throws Exception { JobConf conf = new JobConf(WordCount.class); conf.setJobName("wordcount"); conf.setOutputKeyClass(Text.class); conf.setOutputValueClass(IntWritable.class); conf.setMapperClass(Map.class); conf.setCombinerClass(Reduce.class); conf.setReducerClass(Reduce.class); conf.setInputFormat(TextInputFormat.class); conf.setOutputFormat(TextOutputFormat.class); FileInputFormat.setInputPaths(conf, new Path(args[0])); FileOutputFormat.setOutputPath(conf, new Path(args[1])); JobClient.runJob(conf); } } </code></pre> Do I need to change the map, reduce, or both?? And how?? can someone please help! thanks so much
Tags
<java><hadoop><cloudera>
Title
hadoop key returns multiple values
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USuser3087780
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. This table or related slice is empty.
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.