StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POLaunch a mapreduce job from eclipse
primarykey
Id
11236305
data
AcceptedAnswerId
11248836
AnswerCount
3
ClosedDate
CommentCount
5
CommunityOwnedDate
CreationDate
2012-06-27T23:46:33.697
FavoriteCount
5
LastActivityDate
2017-05-01T06:44:55.343
LastEditDate
2015-10-26T14:53:17.230
LastEditorUserId
861423
OwnerUserId
639868
ParentId
0
PostTypeId
1
Score
8
ViewCount
12435
LastEditorDisplayName
text
Body
I've written a mapreduce program in Java, which I can submit to a remote cluster running in distributed mode. Currently, I submit the job using the following steps: <ol> <li>export the mapreuce job as a jar (e.g. <code>myMRjob.jar</code>)</li> <li>submit the job to the remote cluster using the following shell command: <code>hadoop jar myMRjob.jar</code></li> </ol> I would like to submit the job directly from Eclipse when I try to run the program. How can I do this? I am currently using CDH3, and an abridged version of my conf is: <pre class="lang-java prettyprint-override"><code>conf.set("hbase.zookeeper.quorum", getZookeeperServers()); conf.set("fs.default.name","hdfs://namenode/"); conf.set("mapred.job.tracker", "jobtracker:jtPort"); Job job = new Job(conf, "COUNT ROWS"); job.setJarByClass(CountRows.class); // Set up Mapper TableMapReduceUtil.initTableMapperJob(inputTable, scan, CountRows.MyMapper.class, ImmutableBytesWritable.class, ImmutableBytesWritable.class, job); // Set up Reducer job.setReducerClass(CountRows.MyReducer.class); job.setNumReduceTasks(16); // Setup Overall Output job.setOutputFormatClass(MultiTableOutputFormat.class); job.submit(); </code></pre> When I run this directly from Eclipse, the job is launched but Hadoop cannot find the mappers/reducers. I get the following errors: <pre><code>12/06/27 23:23:29 INFO mapred.JobClient: map 0% reduce 0% 12/06/27 23:23:37 INFO mapred.JobClient: Task Id : attempt_201206152147_0645_m_000000_0, Status : FAILED java.lang.RuntimeException: java.lang.ClassNotFoundException: com.mypkg.mapreduce.CountRows$MyMapper at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:996) at org.apache.hadoop.mapreduce.JobContext.getMapperClass(JobContext.java:212) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:602) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:323) at org.apache.hadoop.mapred.Child$4.run(Child.java:270) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:396) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1127) at org.apache.hadoop.mapred.Child.main(Child.java:264) ... </code></pre> Does anyone know how to get past these errors? If I can fix this, I can integrate more MR jobs into my scripts which would be awesome!
Tags
<eclipse><hadoop><mapreduce>
Title
Launch a mapreduce job from eclipse
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USAlexey Grigorev
UserOwnerUserId
1. USTucker
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
2. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POLaunch a mapreduce job from eclipse
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POLaunch a mapreduce job from eclipse
 UserUserId
 USAlex
 VoteTypeVoteTypeId
 VTFavorite
3. VO
 singulars
 PostPostId
 POLaunch a mapreduce job from eclipse
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.