Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    <p>First, to use Hadoop with Python (whenever you run it on your own cluster, or Amazon EMR, or anything else) you would need an option called "Hadoop Streaming".</p> <p>Read the <a href="http://hadoop.apache.org/common/docs/r0.15.2/streaming.html" rel="nofollow noreferrer">original chapter</a> (<a href="http://hadoop.apache.org/docs/r0.18.0/commands_manual.html" rel="nofollow noreferrer">updated link</a>) of Hadoop Manual to get the idea of how it works.</p> <p>There is also a <a href="http://packages.python.org/mrjob/#table-of-contents" rel="nofollow noreferrer">great library</a> "MrJob" that simplifies running Python jobs on Hadoop. </p> <p>You could set up your own cluster or try to play with Amazon Elastic Map Reduce. The later can cost you something, but sometimes easier to run at the beginning. There is a <a href="http://aws.amazon.com/articles/2294" rel="nofollow noreferrer">great tutorial</a> on how to run Python with Hadoop Streaming on Amazon EMR. It immediately shows a simple but practical application.</p> <p>To learn the Hadoop itself I would recommend reading one of the books out there. They say that <a href="http://rads.stackoverflow.com/amzn/click/1935182196" rel="nofollow noreferrer">"Hadoop In Action"</a> is better in covering things for those who interested in Python/Hadoop Streaming. </p> <p>Also note that for testing/learning things you can run Hadoop on your local machine without having an actual cluster.</p> <p>UPDATE:</p> <p>As for understanding Map Reduce (that is how to identify and express different kinds of problems on Map Reduce language) read the great article "<a href="http://highlyscalable.wordpress.com/2012/02/01/mapreduce-patterns/" rel="nofollow noreferrer">MapReduce Patterns, Algorithms, and Use Cases</a>" with examples in Python.</p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. VO
      singulars
      1. This table or related slice is empty.
    2. VO
      singulars
      1. This table or related slice is empty.
    3. VO
      singulars
      1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload