StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POAligning data table created from perl hash
primarykey
Id
7639608
data
AcceptedAnswerId
7643488
AnswerCount
3
ClosedDate
CommentCount
3
CommunityOwnedDate
CreationDate
2011-10-03T19:20:11.073
FavoriteCount
1
LastActivityDate
2011-10-04T04:59:29.797
LastEditDate
2011-10-03T21:18:27.823
LastEditorUserId
825160
OwnerUserId
825160
ParentId
0
PostTypeId
1
Score
1
ViewCount
649
LastEditorDisplayName
text
Body
I'm trying to write a script to process output from behavioral testing equipment. I need to have all data aligned by timestamp in the resulting CSV file. Here's the catch: the start time differs between test runs (it's close, but not exact - can be off by a few seconds to several minutes). I can get the output I want, and I think I have a good idea as to how I can align all variables, but don't know how to implement it. All data is in a hash with two levels ( %hash{id}{vars} ) with all variables stored as a number to keep things simple (variable names are read from an array on printout). Once all data has been scraped from the input files, the script walks through the hash and prints out data as follows: <pre><code>Variable 1 ID #1 data1 data2 data3... ID #2 data1 data2 data3... ... Variable 2 ... </code></pre> and so on. These are 24 h recordings. The last datapoint (var=20) for all subjects is light: data reads either "ON" or "OFF" for day and night. The best method of alignment I can see is to use the light OFF marker to align data. My thinking is as follows: 1. Find first position for each ID for which var '20' = 'OFF' and record position 2. Figure out which ID has the greatest position for OFF (ie, the one that started recording earliest) 3. Add empty value pairs to every other subject until OFF position is the same for all. For example, if data is recorded once per minute and one subject has an OFF time that is 5 minutes later than all others, add 5 empty data points to all other subjects to align the data. This would have to be done for all datapoints for each subject, not just the lights on/off measure. Would this approach work? And if so, how could I implement this? **Note that I need to be able to package this as a standalone script to run on multiple computers, so I can't count on perl modules that aren't installed by default. --edit per request: example. Input data looks like this (it's a CSV file) <pre><code>ID, TIME, DATA1, DATA2, DATA3, [...] , LIGHT Subj1, 10:00:00, data1, data2, data3, [...] , ON Subj1, 10:00:30, data1, data2, data3, [...] , ON Subj1, 10:01:00, data1, data2, data3, [...] , OFF Subj1, 10:01:00, data1, data2, data3, [...] , OFF </code></pre> For another subject, data might look like this: <pre><code>ID, TIME, DATA1, DATA2, DATA3, [...] , LIGHT Subj2, 09:59:27, data1, data2, data3, [...] , ON Subj2, 09:59:57, data1, data2, data3, [...] , ON Subj2, 10:00:27, data1, data2, data3, [...] , ON Subj2, 10:00:57, data1, data2, data3, [...] , OFF Subj2, 10:01:27, data1, data2, data3, [...] , OFF </code></pre> Script takes each line from all files and adds them to a hash keyed by ID, with one level for each data column keyed by column number. For these two files hash would look like this: <pre><code>$VAR1 = { 'Subj1' => { '1' => [ data1 data1 ... ] '2' => [ data2 data2 ... ] ... '20' => [ ON ON ... } 'Subj1' => { '1' => [ data1 data1 ... ] '2' => [ data2 data2 ... ] ... '20' => [ ON ON ... } }; </code></pre> Data is output with a foreach loop: <pre><code>foreach my $k (sort {$a cmp $b} keys %data) { print OUT $k, "\,"; foreach my $d ( @{ $data{$k}{$i} } ) { print OUT $d, "\,"; } print OUT "\n"; } </code></pre> Output looks like this: <pre><code>TIME Subj1, 10:00:00, 10:00:30, 10:01:00, 10:01:30, Subj2, 09:59:27, 09:59:57, 10:00:27, 10:00:57, 10:01:27, DATA1 Subj1, data1, data1, data1, data1, data1, Subj2, data2, data2, data2, data2, data2, data2, [ ... all other data ... ] LIGHT Subj1, ON, ON, OFF, OFF, Subj2, ON, ON, ON, OFF, OFF, </code></pre> What I need to do is align all data by the ON/OFF columns in LIGHT, by adding empty values like so: <pre><code>TIME Subj1, , 10:00:00, 10:00:30, 10:01:00, 10:01:30, Subj2, 09:59:27, 09:59:57, 10:00:27, 10:00:57, 10:01:27, DATA1 Subj1, , data1, data1, data1, data1, data1, Subj2, data2, data2, data2, data2, data2, data2, [ ... all other data ... ] LIGHT Subj1, , ON, ON, OFF, OFF, Subj2, ON, ON, ON, OFF, OFF, </code></pre> Trying to figure out how best to do this. Sorry this is long...
Tags
<perl><hashtable><alignment>
Title
Aligning data table created from perl hash
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USdr.nixon
UserOwnerUserId
1. USdr.nixon
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
3. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POAligning data table created from perl hash
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POAligning data table created from perl hash
 UserUserId
 USindiguy
 VoteTypeVoteTypeId
 VTFavorite
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.