StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POPython: fast dictionary of big int keys
primarykey
Id
5911191
data
AcceptedAnswerId
6441338
AnswerCount
4
ClosedDate
CommentCount
4
CommunityOwnedDate
CreationDate
2011-05-06T12:09:38.820
FavoriteCount
0
LastActivityDate
2011-06-22T14:12:46.510
LastEditDate
2011-05-06T19:48:05.753
LastEditorUserId
52023
OwnerUserId
52023
ParentId
0
PostTypeId
1
Score
3
ViewCount
1362
LastEditorDisplayName
text
Body
I have got a list of >10.000 int items. The values of the items can be very high, up to 10^27. Now I want to create all pairs of the items and calculate their sum. Then I want to look for different pairs with the same sum. For example: <pre><code>l[0] = 4 l[1] = 3 l[2] = 6 l[3] = 1 ... pairs[10] = [(0,2)] # 10 is the sum of the values of l[0] and l[2] pairs[7] = [(0,1), (2,3)] # 7 is the sum of the values of l[0] and l[1] or l[2] and l[3] pairs[5] = [(0,3)] pairs[9] = [(1,2)] ... </code></pre> The contents of <code>pairs[7]</code> is what I am looking for. It gives me two pairs with the same value sum. I have implemented it as follows - and I wonder if it can be done faster. Currently, for 10.000 items it takes >6 hours on a fast machine. (As I said, the values of <code>l</code> and so the keys of <code>pairs</code> are ints up to 10^27.) <pre><code>l = [4,3,6,1] pairs = {} for i in range( len( l ) ): for j in range(i+1, len( l ) ): s = l[i] + l[j] if not s in pairs: pairs[s] = [] pairs[s].append((i,j)) # pairs = {9: [(1, 2)], 10: [(0, 2)], 4: [(1, 3)], 5: [(0, 3)], 7: [(0, 1), (2, 3)]} </code></pre> <hr> Edit: I want to add some background, as asked by Simon Stelling. The goal is to find Formal Analogies like <pre><code>lays : laid :: says : said </code></pre> within a list of words like <pre><code>[ lays, lay, laid, says, said, foo, bar ... ] </code></pre> I already have a function <code>analogy(a,b,c,d)</code> giving <code>True</code> if <code>a : b :: c : d</code>. However, I would need to check all possible quadruples created from the list, which would be a complexity of around O((n^4)/2). As a pre-filter, I want to use the char-count property. It says that every char has the same count in (a,d) and in (b,c). For instance, in "layssaid" we have got 2 a's, and so we do in "laidsays" So the idea until now was <ul> <li>for every word to create a "char count vector" and represent it as an integer (the items in the list <code>l</code>)</li> <li>create all pairings in <code>pairs</code> and see if there are "pair clusters", i.e. more than one pair for a particular char count vector sum.</li> </ul> And it works, it's just slow. The complexity is down to around O((n^2)/2) but this is still a lot, and especially the dictionary lookup and insert is done that often.
Tags
<python><optimization><dictionary><int><biginteger>
Title
Python: fast dictionary of big int keys
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USGeorg Jähnig
UserOwnerUserId
1. USGeorg Jähnig
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
3. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POPython: fast dictionary of big int keys
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POPython: fast dictionary of big int keys
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 POPython: fast dictionary of big int keys
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.