StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POLimiting concurrency and rate for Python threads
primarykey
Id
7586743
data
AcceptedAnswerId
7587080
AnswerCount
2
ClosedDate
CommentCount
1
CommunityOwnedDate
CreationDate
2011-09-28T16:51:50.297
FavoriteCount
1
LastActivityDate
2011-10-09T18:40:17.927
LastEditDate
2011-10-09T18:40:17.927
LastEditorUserId
359834
OwnerUserId
359834
ParentId
0
PostTypeId
1
Score
4
ViewCount
2145
LastEditorDisplayName
text
Body
Given a number threads I want to limit the rate of calls to the worker function to a rate of say one per second. My idea was to keep track of the last time a call was made across all threads and compare this to the current time in each thread. Then if <code>current_time - last_time < rate</code>. I let the thread sleep for a bit. Something is wrong with my implementation - I presume I may have gotten the wrong idea about how locks work. My code: <pre><code>from Queue import Queue from threading import Thread, Lock, RLock import time num_worker_threads = 2 rate = 1 q = Queue() lock = Lock() last_time = [time.time()] def do_work(i, idx): # Do work here, print is just a dummy. print('Thread: {0}, Item: {1}, Time: {2}'.format(i, idx, time.time())) def worker(i): while True: lock.acquire() current_time = time.time() interval = current_time - last_time[0] last_time[0] = current_time if interval < rate: time.sleep(rate - interval) lock.release() item = q.get() do_work(i, item) q.task_done() for i in range(num_worker_threads): t = Thread(target=worker, args=[i]) t.daemon = True t.start() for item in xrange(10): q.put(item) q.join() </code></pre> I was expecting to see one call per second to <code>do_work</code>, however, I get mostly 2 calls at the same time (1 for each thread), followed by a one second pause. What is wrong? <hr> Ok, some edit. The advice to simply throttle the rate at which items are put in the queue was good, however I remembered that I had to take care of the case in which items are re-added to the queue by the workers. Canonical example: pagination or backing-off-retry in network tasks. I came up with the following. I guess that for actual network tasks eventlet/gevent libraries may be easier on resources but this is just an example. It basically uses a priority queue to pile up the requests and uses an extra thread to shovel items from the pile to the actual task queue at an even rate. I simulated re-insertion into the pile by the workers, re-inserted items are then treated first. <pre><code>import sys import os import time import random from Queue import Queue, PriorityQueue from threading import Thread rate = 0.1 def worker(q, q_pile, idx): while True: item = q.get() print("Thread: {0} processed: {1}".format(item[1], idx)) if random.random() > 0.3: print("Thread: {1} reinserting item: {0}".format(item[1], idx)) q_pile.put((-1 * time.time(), item[1])) q.task_done() def schedule(q_pile, q): while True: if not q_pile.empty(): print("Items on pile: {0}".format(q_pile.qsize())) q.put(q_pile.get()) q_pile.task_done() time.sleep(rate) def main(): q_pile = PriorityQueue() q = Queue() for i in range(5): t = Thread(target=worker, args=[q, q_pile, i]) t.daemon = True t.start() t_schedule = Thread(target=schedule, args=[q_pile, q]) t_schedule.daemon = True t_schedule.start() [q_pile.put((-1 * time.time(), i)) for i in range(10)] q_pile.join() q.join() if __name__ == '__main__': main() </code></pre>
Tags
<python><multithreading><concurrency><rate-limiting>
Title
Limiting concurrency and rate for Python threads
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USOG Dude
UserOwnerUserId
1. USOG Dude
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POLimiting concurrency and rate for Python threads
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POLimiting concurrency and rate for Python threads
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 POLimiting concurrency and rate for Python threads
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. COWhat's your question? Do you just want some [codereview](http://codereview.stackexchange.com)?
 singulars
 PostPostId
 POLimiting concurrency and rate for Python threads
 UserUserId
 USSingleNegationElimination

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.