StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POData structure for maintaining tabular data in memory?
primarykey
Id
1038160
data
AcceptedAnswerId
1038203
AnswerCount
6
ClosedDate
CommentCount
1
CommunityOwnedDate
CreationDate
2009-06-24T12:48:51.077
FavoriteCount
35
LastActivityDate
2016-09-28T22:35:58.497
LastEditDate
2016-09-28T22:35:58.497
LastEditorUserId
4370109
OwnerUserId
103532
ParentId
0
PostTypeId
1
Score
61
ViewCount
103645
LastEditorDisplayName
text
Body
My scenario is as follows: I have a table of data (handful of fields, less than a hundred rows) that I use extensively in my program. I also need this data to be persistent, so I save it as a CSV and load it on start-up. I choose not to use a database because every option (even SQLite) is an overkill for my humble requirement (also - I would like to be able to edit the values offline in a simple way, and nothing is simpler than notepad). Assume my data looks as follows (in the file it's comma separated without titles, this is just an illustration): <pre><code> Row | Name | Year | Priority ------------------------------------ 1 | Cat | 1998 | 1 2 | Fish | 1998 | 2 3 | Dog | 1999 | 1 4 | Aardvark | 2000 | 1 5 | Wallaby | 2000 | 1 6 | Zebra | 2001 | 3 </code></pre> Notes: <ol> <li>Row may be a "real" value written to the file or just an auto-generated value that represents the row number. Either way it exists in memory.</li> <li>Names are unique.</li> </ol> Things I do with the data: <ol> <li>Look-up a row based on either ID (iteration) or name (direct access).</li> <li>Display the table in different orders based on multiple field: I need to sort it e.g. by Priority and then Year, or Year and then Priority, etc.</li> <li>I need to count instances based on sets of parameters, e.g. how many rows have their year between 1997 and 2002, or how many rows are in 1998 and priority > 2, etc.</li> </ol> I know this "cries" for SQL... I'm trying to figure out what's the best choice for data structure. Following are several choices I see: List of row lists: <pre><code>a = [] a.append( [1, "Cat", 1998, 1] ) a.append( [2, "Fish", 1998, 2] ) a.append( [3, "Dog", 1999, 1] ) ... </code></pre> List of column lists (there will obviously be an API for add_row etc): <pre><code>a = [] a.append( [1, 2, 3, 4, 5, 6] ) a.append( ["Cat", "Fish", "Dog", "Aardvark", "Wallaby", "Zebra"] ) a.append( [1998, 1998, 1999, 2000, 2000, 2001] ) a.append( [1, 2, 1, 1, 1, 3] ) </code></pre> Dictionary of columns lists (constants can be created to replace the string keys): <pre><code>a = {} a['ID'] = [1, 2, 3, 4, 5, 6] a['Name'] = ["Cat", "Fish", "Dog", "Aardvark", "Wallaby", "Zebra"] a['Year'] = [1998, 1998, 1999, 2000, 2000, 2001] a['Priority'] = [1, 2, 1, 1, 1, 3] </code></pre> Dictionary with keys being tuples of (Row, Field): <pre><code>Create constants to avoid string searching NAME=1 YEAR=2 PRIORITY=3 a={} a[(1, NAME)] = "Cat" a[(1, YEAR)] = 1998 a[(1, PRIORITY)] = 1 a[(2, NAME)] = "Fish" a[(2, YEAR)] = 1998 a[(2, PRIORITY)] = 2 ... </code></pre> And I'm sure there are other ways... However each way has disadvantages when it comes to my requirements (complex ordering and counting). What's the recommended approach? EDIT: To clarify, performance is not a major issue for me. Because the table is so small, I believe almost every operation will be in the range of milliseconds, which is not a concern for my application.
Tags
<python><data-structures>
Title
Data structure for maintaining tabular data in memory?
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USRoee Adler
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
3. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POData structure for maintaining tabular data in memory?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POData structure for maintaining tabular data in memory?
 UserUserId
 USnikow
 VoteTypeVoteTypeId
 VTFavorite
3. VO
 singulars
 PostPostId
 POData structure for maintaining tabular data in memory?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.