StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
620568
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
7
CommunityOwnedDate
CreationDate
2009-03-06T21:51:40.460
FavoriteCount
0
LastActivityDate
2009-03-07T14:53:52.373
LastEditDate
2009-03-07T14:53:52.373
LastEditorUserId
68507
OwnerUserId
68507
ParentId
605480
PostTypeId
2
Score
3
ViewCount
0
LastEditorDisplayName
Ben S
text
Body
These "start-step-stop" codes looks like a different way of calling <a href="http://en.wikipedia.org/wiki/Huffman_coding" rel="nofollow noreferrer">Huffman codes</a>. See the <a href="http://en.wikipedia.org/wiki/Huffman_coding#Basic_technique" rel="nofollow noreferrer">basic technique</a> for an outline of the pseudo-code for calculating them. Essentially this is what the algorithm does: Before you start the Huffman encoding you need to gather the statistics of each symbol you'll be compressing (Their total frequency in the file to compress). After you have that you create a <a href="http://en.wikipedia.org/wiki/Binary_tree" rel="nofollow noreferrer">binary tree</a> using that info such that the most frequently used symbols are at the top of the tree (and thus use less bits) and such that no encoding has a <a href="http://en.wikipedia.org/wiki/Prefix_code" rel="nofollow noreferrer">prefix code</a>. Since if an encoding has a common prefix there could be ambiguities decompressing. At the end of the Huffman encoding your start value will be depth of the shallowest leaf node, your step will always be 1 (logically this makes sense, why would you force more bits than you need, just add one at a time,) and your stop value will be the depth of the deepest leaf node. If the frequency stats aren't sorted it will take O(nlog n) to do, if they are sorted by frequency it can be done in O(n). Huffman codes are guaranteed to have the best average compression for this type of encoding: <blockquote> Huffman was able to design the most efficient compression method of this type: no other mapping of individual source symbols to unique strings of bits will produce a smaller average output size when the actual symbol frequencies agree with those used to create the code. </blockquote> This should help you implement the ideal solution to your problem. Edit: Though similar, this isn't what the OP was looking for. This <a href="http://www.stevenpigeon.org/Publications/publications/ssc_full.pdf" rel="nofollow noreferrer">academic paper</a> by the creator of these codes describes a generalization of start-step-stop codes, start-stop codes. However, the author briefly describes how to get optimal start-step-stop near the end of section 2. It involves using a statistical random variable, or brute-force funding the best combination. Without any prior knowledge of the file the algorithm is O((log n)^3). Hope this helps.
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POHow can one compute the optimal parameters to a start-step-stop coding scheme?
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USBen S
UserOwnerUserId
1. USBen S
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POHow can one compute the optimal parameters to a start-step-stop coding scheme?
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.