StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POFor nested loops with CUDA
primarykey
Id
9921873
data
AcceptedAnswerId
9922568
AnswerCount
1
ClosedDate
CommentCount
2
CommunityOwnedDate
CreationDate
2012-03-29T08:39:58.743
FavoriteCount
1
LastActivityDate
2012-03-29T12:23:58.447
LastEditDate
LastEditorUserId
0
OwnerUserId
1300224
ParentId
0
PostTypeId
1
Score
3
ViewCount
2091
LastEditorDisplayName
text
Body
I'm having a problem with some for nested loops that I have to convert from C/C++ into CUDA. Basically I have 4 for nested loops which are sharing the same array and making bit shift operations. <pre><code>#define N 65536 // ---------------------------------------------------------------------------------- int a1,a2,a3,a4, i1,i2,i3,i4; int Bit4CBitmapLookUp[16] = {0, 1, 3, 3, 7, 7, 7, 7, 15, 15, 15, 15, 15, 15, 15, 15}; int _cBitmapLookupTable[N]; int s = 0; // index into the cBitmapLookupTable for (i1 = 0; i1 < 16; i1++) { // first customer a1 = Bit4CBitmapLookUp[i1] << 12; for (i2 = 0; i2 < 16; i2++) { // second customer a2 = Bit4CBitmapLookUp[i2] << 8; for (i3 = 0; i3 < 16; i3++) { // third customer a3 = Bit4CBitmapLookUp[i3] << 4; for (i4 = 0;i4 < 16;i4++) { // fourth customer a4 = Bit4CBitmapLookUp[i4]; // now actually set the sBitmapLookupTable value _cBitmapLookupTable[s] = a1 | a2 | a3 | a4; s++; } // for i4 } // for i3 } // for i2 } // for i1 </code></pre> This is the code that I should convert into CUDA. I tried different ways but everytime i having the wrong output. Here i post my version of CUDA conversion (the piece from kernel's part) <pre><code>#define N 16 //---------------------------------------------------------------------------------- // index for the GPU int i1 = blockDim.x * blockIdx.x + threadIdx.x; int i2 = blockDim.y * blockIdx.y + threadIdx.y; int i3 = i1; int i4 = i2; __syncthreads(); for(i1 = i2 = 0; i1 < N, i2 < N; i1++, i2++) { // first customer a1 = Bit4CBitmapLookUp_device[i1] << 12; // second customer a2 = Bit4CBitmapLookUp_device[i2] << 8; for(i3 = i4 = 0; i3 < N, i4 < N; i3++, i4++){ // third customer a3 = Bit4CBitmapLookUp_device[i3] << 4; // fourth customer a4 = Bit4CBitmapLookUp_device[i4]; // now actually set the sBitmapLookupTable value _cBitmapLookupTable[s] = a1 | a2 | a3 | a4; s++; } } </code></pre> I'm brand new in CUDA and I'm still learning, but really i can't find a solution for those for nested loops. Thank you in advance.
Tags
<c++><c><for-loop><cuda><parallel-processing>
Title
For nested loops with CUDA
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USdavideberdin
plurals
PostLinksPostIdRelatedPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
2. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
3. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostLinksRelatedPostIdPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
2. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POFor nested loops with CUDA
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POFor nested loops with CUDA
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 POFor nested loops with CUDA
 UserUserId
 USSoftware_Designer
 VoteTypeVoteTypeId
 VTFavorite
CommentsPostId
1. COHint: you're initializing the variables `i1`...`i4` to values that never get used.
 singulars
 PostPostId
 POFor nested loops with CUDA
 UserUserId
 USleftaroundabout
2. COSee this -> http://stackoverflow.com/questions/5306117/cuda-kernel-nested-for-loop http://stackoverflow.com/questions/6479715/nested-loops-to-cuda http://stackoverflow.com/questions/9527026/cumulative-sum-in-two-dimensions-on-array-in-nested-loop-cuda-implementation
 singulars
 PostPostId
 POFor nested loops with CUDA
 UserUserId
 This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.