Note that there are some explanatory texts on larger screens.

plurals
  1. POnosql: MongoDB, Cassandra or alternative for data warehousing
    text
    copied!<p>I am stuck between a concrete decision on whether to go with MongoDB or Cassandra for my database needs and would like input on my use case as to guide my decision.</p> <p><em>Requirements:</em></p> <p>Data source </p> <ul> <li>X datacenters containing Y servers. </li> <li>Each server has N networks and M statistics.</li> </ul> <p>e.g. Currently ( 3 datacenters, 50 total servers, 19 networks and 10 stats ). These numbers will increase over time.</p> <p><em>Data fetching:</em></p> <ul> <li>Parse an xml page for each server (~20kb / page ) every hour. (~25mb / day )</li> </ul> <p><em>Data storage:</em></p> <ul> <li>Organized (hourly,daily, monthly) structure using aggregation to find higher values (hours -> day ) </li> </ul> <p><em>Note:</em> We need the ability to:</p> <ul> <li>Dynamically add / remove values ( datacenters / servers / networks / statistics ) and scale-ability is a key issue, hence we are moving from SQL over to NoSQL. </li> <li>Reliability is also a high priority ( master / slave, no corruption ) and will require an "easy" maintainability.</li> <li>Writing is hourly, no need for "massive" writing performance.</li> </ul> <p>Example use case: On the front-end you will query like so, select; date window, period report, specific datacenter, specific/all networks, specific/all statistics and whether results are totalled or individual across the servers.</p> <pre><code>Example #1 - From: August 16th 2012 -&gt; April 16th 2013 - Period: Daily - Data-center: EU A - Stat-type: Error - Servers: All </code></pre> <p>From reading similar articles across stack-overflow and the web, I've come to the conclusion that my best bet may be MongoDB for its flexible queries and closeness to a relational database. Cassandra seems like an option if my writes were of higher volumes - although I do like the column based model. I am a novice to database design and management so ease of use is also a factor (still a CS student).</p> <p>From my use cases which NoSql database is the best option?</p>
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload