StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
2982965
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
5
CommunityOwnedDate
CreationDate
2010-06-06T03:06:13.140
FavoriteCount
0
LastActivityDate
2010-06-06T03:53:53.030
LastEditDate
2010-06-06T03:53:53.030
LastEditorUserId
303180
OwnerUserId
303180
ParentId
2982957
PostTypeId
2
Score
4
ViewCount
0
LastEditorDisplayName
text
Body
That's referring to "variable integer encoding", where the number of bits used to store an integer when serialized is not fixed at 4 bytes. There is a good description of <a href="http://code.google.com/apis/protocolbuffers/docs/encoding.html#varints" rel="nofollow noreferrer">varint in the protocol buffer documentation</a>. It is used in encoding <a href="http://code.google.com/apis/protocolbuffers/" rel="nofollow noreferrer">Google's protocol buffers</a>, and you can browse the <a href="http://www.google.com/codesearch/p?hl=en#WTeibokF6gE/trunk/src/google/protobuf/wire_format.h&q=varint%20package:http://protobuf%5C.googlecode%5C.com&d=4" rel="nofollow noreferrer">protocol buffer source code</a>. The <code>CodedOutputStream</code> contains the exact encoding function <a href="http://www.google.com/codesearch/p?hl=en#WTeibokF6gE/trunk/src/google/protobuf/io/coded_stream.cc&q=varint%20package:http://protobuf%5C.googlecode%5C.com&d=4&l=632" rel="nofollow noreferrer">WriteVarint32FallbackToArrayInline</a>: <pre><code>inline uint8* CodedOutputStream::WriteVarint32FallbackToArrayInline( uint32 value, uint8* target) { target[0] = static_cast<uint8>(value | 0x80); if (value >= (1 << 7)) { target[1] = static_cast<uint8>((value >> 7) | 0x80); if (value >= (1 << 14)) { target[2] = static_cast<uint8>((value >> 14) | 0x80); if (value >= (1 << 21)) { target[3] = static_cast<uint8>((value >> 21) | 0x80); if (value >= (1 << 28)) { target[4] = static_cast<uint8>(value >> 28); return target + 5; } else { target[3] &= 0x7F; return target + 4; } } else { target[2] &= 0x7F; return target + 3; } } else { target[1] &= 0x7F; return target + 2; } } else { target[0] &= 0x7F; return target + 1; } } </code></pre> The cascading <code>if</code>s will only add additional bytes onto the end of the <code>target</code> array if the magnitude of <code>value</code> warrants those extra bytes. The <code>0x80</code> masks the byte being written, and the <code>value</code> is shifted down. From what I can tell, the <code>0x7f</code> mask causes it to signify the "last byte of encoding". (When OR'ing <code>0x80</code>, the highest bit will always be <code>1</code>, then the last byte clears the highest bit (by AND'ing <code>0x7f</code>). So, when reading varints you read until you get a byte with a zero in the highest bit. I just realized you asked about "Group VarInt encoding" specifically. Sorry, that code was about basic VarInt encoding (still faster than 7-bit). The basic idea looks to be similar. Unfortunately, it's not what's being used to store 64bit numbers in protocol buffers. I wouldn't be surprised if that code was open sourced somewhere though. Using the ideas from <code>varint</code> and the diagrams of "Group varint" from the slides, it shouldn't be too too hard to cook up your own :) Here is another page describing <a href="http://www.ir.uwaterloo.ca/book/addenda-06-index-compression.html" rel="nofollow noreferrer">Group VarInt compression</a>, which contains decoding code. Unfortunately they allude to publicly available implementations, but they don't provide references. <pre><code>void DecodeGroupVarInt(const byte* compressed, int size, uint32_t* uncompressed) { const uint32_t MASK[4] = { 0xFF, 0xFFFF, 0xFFFFFF, 0xFFFFFFFF }; const byte* limit = compressed + size; uint32_t current_value = 0; while (compressed != limit) { const uint32_t selector = *compressed++; const uint32_t selector1 = (selector & 3); current_value += *((uint32_t*)(compressed)) & MASK[selector1]; *uncompressed++ = current_value; compressed += selector1 + 1; const uint32_t selector2 = ((selector >> 2) & 3); current_value += *((uint32_t*)(compressed)) & MASK[selector2]; *uncompressed++ = current_value; compressed += selector2 + 1; const uint32_t selector3 = ((selector >> 4) & 3); current_value += *((uint32_t*)(compressed)) & MASK[selector3]; *uncompressed++ = current_value; compressed += selector3 + 1; const uint32_t selector4 = (selector >> 6); current_value += *((uint32_t*)(compressed)) & MASK[selector4]; *uncompressed++ = current_value; compressed += selector4 + 1; } } </code></pre>
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POLooking for more details about "Group varint encoding/decoding" presented in Jeff's slides
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USStephen
UserOwnerUserId
1. USStephen
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POLooking for more details about "Group varint encoding/decoding" presented in Jeff's slides
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTAcceptedByOriginator
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.