StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
4320422
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
3
CommunityOwnedDate
CreationDate
2010-12-01T01:23:19.173
FavoriteCount
0
LastActivityDate
2015-04-07T10:35:59.340
LastEditDate
2015-04-07T10:35:59.340
LastEditorUserId
579132
OwnerUserId
202699
ParentId
4317551
PostTypeId
2
Score
9
ViewCount
0
LastEditorDisplayName
text
Body
(1) Nested parallelism in OpenMP: <a href="http://docs.oracle.com/cd/E19205-01/819-5270/aewbc/index.html" rel="nofollow noreferrer">http://docs.oracle.com/cd/E19205-01/819-5270/aewbc/index.html</a> You need to turn on nested parallelism by setting <code>OMP_NESTED</code> or <code>omp_set_nested</code> because many implementations turn off this feature by default, even some implementations didn't support nested parallelism fully. If turned on, whenever you meet <code>parallel for</code>, OpenMP will create the number of threads as defined in <code>OMP_NUM_THREADS</code>. So, if 2-level parallelism, the total number of threads would be N^2, where N = <code>OMP_NUM_THREADS</code>. Such nested parallelism will cause oversubscription, (i.e., the number of busy threads is greater than the cores), which may degrade the speedup. In an extreme case, where nested parallelism is called recursively, threads could be bloated (e.g., creating 1000s threads), and computer just wastes time for context switching. In such case, you may control the number of threads dynamically by setting <code>omp_set_dynamic</code>. (2) An example of matrix-vector multiplication: the code would look like: <pre><code>// Input: A(N by M), B(M by 1) // Output: C(N by 1) for (int i = 0; i < N; ++i) for (int j = 0; j < M; ++j) C[i] += A[i][j] * B[j]; </code></pre> In general, parallelizing inner loops while outer loops are possible is bad because of forking/joining overhead of threads. (though many OpenMP implementations pre-create threads, it still requires some to dispatch tasks to threads and to call implicit barrier at the end of parallel-for) Your concern is the case of where N < # of CPU. Yes, right, in this case, the speedup would be limited by N, and letting nested parallelism will definitely have benefits. However, then the code would cause oversubscription if N is sufficiently large. I'm just thinking the following solutions: <ul> <li>Changing the loop structure so that only 1-level loop exists. (It looks doable)</li> <li>Specializing the code: if N is small, then do nested parallelism, otherwise don't do that.</li> <li>Nested parallelism with <code>omp_set_dynamic</code>. But, please make it sure how <code>omp_set_dynamic</code> controls the number of threads and the activity of threads. Implementations may vary.</li> </ul>
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POOpenMP: What is the benefit of nesting parallelizations?
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USConor Taylor
UserOwnerUserId
1. USminjang
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POOpenMP: What is the benefit of nesting parallelizations?
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTAcceptedByOriginator
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.