StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POExtracting data from a string where the data structure is embedded in the string itself
primarykey
Id
8759083
data
AcceptedAnswerId
8762059
AnswerCount
2
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2012-01-06T14:10:10.843
FavoriteCount
0
LastActivityDate
2012-01-06T17:42:45.427
LastEditDate
LastEditorUserId
0
OwnerUserId
1134087
ParentId
0
PostTypeId
1
Score
2
ViewCount
326
LastEditorDisplayName
text
Body
In a project we are doing we encounter log files of which each line has the following structure: 2012-01-02,12:50:32,658,2,1,2,0,0,0,0,1556,1555,62,60,2,3,0,0,0,0,1559,1557,1557,63,64,65,0.305,0.265,0.304,0.308,0.309 The structure of the string is embedded in the string itself. First we have some metadata: <ul> <li>date: 2012/01/02</li> <li>time: 12:50:32</li> <li>measurement number: 658</li> <li>number of measurement groups: 2</li> </ul> This is then followed by the data of each group sequentially. <ul> <li>Measurement group 1: 1,2,0,0,0,0,1556,1555,62,60</li> <li>Measurement group 2: 2,3,0,0,0,0,1559,1557,1557,63,64,65</li> </ul> Group data has the following structure (measurement group 1 used below as an example): <ul> <li>number of the measurement group:1</li> <li>number of sensors in this group:2</li> <li>control field 1 to 4 (0 most of the time):0,0,0,0</li> <li>raw values of type 1 for each sensor (>1500 in the examples):1556,1555</li> <li>raw values of type 2 for each sensor (~60 in the examples),62,60</li> </ul> The line continues with the calculated values for all sensors mentioned above consecutively (i.e. no more control values, or raw values) In the example, the total number of sensors = 2 + 3 = 5 so the calculated line is: 0.305,0.265,0.304,0.308,0.309 My question is this: If we want to normalize the values for each sensor like this: date, time, number of measurement group, number sensor in group, (raw value type 1, raw value type 2, calculated value) What would be a flexible solution, given that a any date-time each variable is well... variable (meaning that the number of measurement group is variable, and the number of sensors in each group can also be variable? For the example final output should be something like: <ul> <li>2012/01/02,12:50:32,1,1,(1556,62,0.305)</li> <li>2012/01/02,12:50:32,1,2,(1555,60,0.265)</li> <li>2012/01/02,12:50:32,2,1,(1559,63,0.304)</li> <li>2012/01/02,12:50:32,2,2,(1557,64,0.308)</li> <li>2012/01/02,12:50:32,2,3,(1557,65,0.309)</li> </ul> What I did up to now is to segment the measurement into cases over time and define "statically" which columns are to be inserted for a line belonging to a case, which group a sensor belongs to, what its sensornumber is,... This is hardly a good solution as each change in the measurement setup results in more changes to the code. <pre><code>line="""2012-01-02,12:50:32,658,2,1,2,0,0,0,0,1556,1555,62,60,2,3,0,0,0,0,1559,1557,1557,63,64,65,0.305,0.265,0.304,0.308,0.309""" parts=line.split(",") date=parts[0] groupnames=[1,1,2,2,2] sensornumbers=[1,2,1,2,3] raw_type1_idx=[10,11,20,21,22] raw_type2_idx=[12,13,23,24,25] calc_idx=[26,27,28,29,30] for i,j,k,l,m in zip(groupnames,sensornumbers,raw_type1_idx,raw_type2_idx,calc_idx): output_tpl= parts[k],parts[l],parts[m] print "%s,%s,%s,%s" % (date,i,j,output_tpl) </code></pre> Is there a better Python way of doing stuff like this?
Tags
<python>
Title
Extracting data from a string where the data structure is embedded in the string itself
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USSpeediro
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POExtracting data from a string where the data structure is embedded in the string itself
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.