Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    <p><strong>There's a ready-to-use solution from Facebook - <a href="https://github.com/facebook/scribe" rel="nofollow">Scribe</a></strong> - that is using Apache Hadoop under the hood. However, most companies I'm aware of still tend to develop in-house systems for that. I worked in one such company and dealt with logs there about two years ago. We also used Hadoop. In our case we had the following setup:</p> <ul> <li>We had a small dedicated cluster of machines for log aggregation.</li> <li>Workers mined logs from production service and then parse individual lines.</li> <li>Then reducers would aggregate the necessary data and prepare reports.</li> </ul> <p>We had a small and fixed number of reports that we were interested in. In rare cases when we wanted to perform a different kind of analysis we would simply add a specialized reducer code for that and optionally run it against old logs.</p> <p>If you can't decide what kind of analyses you are interested in in advance then it'll be better to store structured data prepared by workers in HBase or some other NoSQL database (<a href="http://www.cubrid.org/blog/dev-platform/log-analysis-system-using-hadoop-and-mongodb/" rel="nofollow">here, for example, people use Mongo DB</a>). That way you won't need to re-aggregate data from the raw logs and will be able to query the datastore instead.</p> <p>There are a number of good articles about such logging aggregation solutions, for example, <a href="http://www.cloudera.com/blog/2009/06/analyzing-apache-logs-with-pig/" rel="nofollow">using Pig to query the aggregated data</a>. <a href="http://pig.apache.org" rel="nofollow">Pig</a> lets you query large Hadoop-based datasets with SQL-like queries.</p>
    singulars
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. VO
      singulars
      1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload