Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    <p>Just as a quick check on the number of file handlers you have / are used try this (unix):</p> <pre><code>cat /proc/sys/fs/file-nr </code></pre> <p>That will give you (number of allocated file handlers) - (number of allocated but unused file handlers) - (maximum number of file handlers) --- see <a href="https://stackoverflow.com/questions/10369309/opendir-too-many-open-files">here</a>.</p> <p>It is possible to change the limits in <em>sysctl.conf</em> (on linux - see <a href="https://stackoverflow.com/questions/10369309/opendir-too-many-open-files">link</a> above) - but this is probably not a good idea resource management-wise and, therefore, isn't really scalable. And, yes, things start to get slower as more handlers get used for opening each file (as they don't get closed until after the shell execution stops/ends) and eventually it fails when there are no more handlers available.</p> <p>One potential solution could incorporate <a href="http://www.scipy.org/" rel="nofollow noreferrer">Python/SciPy/Pandas</a> and a simple database. There's great documentation and large community support. An example that is closely related to your post is <a href="https://stackoverflow.com/questions/11174367/averaging-data-from-multiple-data-files-in-python-with-pandas?rq=1">here</a>. A small post on interfacing Pandas and a database linked <a href="https://stackoverflow.com/questions/10065051/python-pandas-and-databases-like-mysql">here</a>.</p> <p>I haven't tried this but I'll give it a shot:</p> <p>For the database you could use something like the pandas <code>io.sql</code> module to create a useful representation of each dat file (maybe using the A# header as an identifier for each table). Then the data can be manipulated by any number of methods e.g. <a href="https://stackoverflow.com/questions/11174367/averaging-data-from-multiple-data-files-in-python-with-pandas?rq=1">glued</a>. This does not retain the <code>./merge.sh data* &gt; dataComb.dat</code> functionality that you ask for but a simple python command line script could potentially handle all the steps to get data in and processed as you wish.</p> <p>I think it would be quite the learning curve but it could pay off in scalability/flexibility for the future.</p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload