StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
13591759
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
3
CommunityOwnedDate
CreationDate
2012-11-27T19:39:00.383
FavoriteCount
0
LastActivityDate
2012-11-27T21:52:13.183
LastEditDate
2017-05-23T11:56:11.990
LastEditorUserId
-1
OwnerUserId
1695960
ParentId
13591237
PostTypeId
2
Score
1
ViewCount
0
LastEditorDisplayName
text
Body
One approach is to flatten your 2D array and handle it in a 1D fashion with pointer arithmetic to handle the row and column dimensions. First of all in your struct definition, replace the antenna_pattern element with: <pre><code>struct LR { . . float *antenna_pattern; } LR; </code></pre> Then you will need to do a host-side malloc to allocate space: <pre><code>#define COL 1001 #define ROW 361 #define DSIZE (ROW*COL) LR.antenna_pattern = (float *)malloc(DSIZE*sizeof(float)); </code></pre> And a device side cuda malloc: <pre><code>float *d_antenna_pattern; cudaMalloc((void **) &d_antenna_pattern, DSIZE*sizeof(float)); </code></pre> The copy to the device looks like: <pre><code>cudaMemcpy(d_antenna_pattern, LR.antenna_pattern, DSIZE*sizeof(float), cudaMemcpyHostToDevice); </code></pre> When you want to reference into these arrays, you will have to do pointer arithmetic like: <pre><code>float my_val_xy = ap[(x*COL)+y]; // to access element at [x][y] on the device float my_val_xy = LR.antenna_pattern[(x*COL)+y]; // on the host </code></pre> If you want to maintain the 2D array subscripts throughout, you can do this with an appropriate typedef. For an example, see the first code sample in my answer to <a href="https://stackoverflow.com/questions/12924155/sending-3d-array-to-cuda-kernel/12925014#12925014">this question</a>. To diagram this out, you would need to start with a typedef: <pre><code>#define COL 1001 #define ROW 361 #define DSIZE (ROW*COL) typedef float aParray[COL]; </code></pre> and modify your structure definition: <pre><code>struct LR { . . aParray *antenna_pattern; } LR; </code></pre> The host side malloc would look like: <pre><code>LR.antenna_pattern = (aParray *)malloc(DSIZE*sizeof(float)); </code></pre> The device side cuda malloc would look like: <pre><code>aParray *d_antenna_pattern; cudaMalloc((void **) &d_antenna_pattern, DSIZE*sizeof(float)); </code></pre> The copy to the device looks like: <pre><code>cudaMemcpy(d_antenna_pattern, LR.antenna_pattern, DSIZE*sizeof(float), cudaMemcpyHostToDevice); </code></pre> The device kernel definition will need a function parameter like: <pre><code>__global__ void myKernel(float ap[][COL]) { </code></pre> Then inside the kernel you can access an element at x,y as: <pre><code>float my_val_xy = ap[x][y]; </code></pre> Now in response to a follow-up question asking what to do if LR cannot be changed, here is a complete sample code which combines some of these ideas without modifying the LR structure: <pre><code>#include<stdio.h> // for cuda error checking #define cudaCheckErrors(msg) \ do { \ cudaError_t __err = cudaGetLastError(); \ if (__err != cudaSuccess) { \ fprintf(stderr, "Fatal error: %s (%s at %s:%d)\n", \ msg, cudaGetErrorString(__err), \ __FILE__, __LINE__); \ fprintf(stderr, "*** FAILED - ABORTING\n"); \ return 1; \ } \ } while (0) struct LR { int foo; float antenna_pattern[361][1001]; } LR; __global__ void mykernel(float ap[][1001]){ int tid = threadIdx.x + (blockDim.x*blockIdx.x); float myval = 0.0; if (tid == 0){ for (int i=0; i<361; i++) for (int j=0; j<1001; j++) ap[i][j] = myval++; } } int main(){ typedef float aParray[1001]; aParray *d_antenna_pattern; cudaMalloc((void **) &d_antenna_pattern, (361*1001)*sizeof(float)); cudaCheckErrors("cudaMalloc fail"); float *my_ap_ptr; my_ap_ptr = &(LR.antenna_pattern[0][0]); for (int i=0; i< 361; i++) for (int j=0; j<1001; j++) LR.antenna_pattern[i][j] = 0.0; cudaMemcpy(d_antenna_pattern, my_ap_ptr, (361*1001)*sizeof(float), cudaMemcpyHostToDevice); cudaCheckErrors("cudaMemcpy fail"); mykernel<<<1,1>>>(d_antenna_pattern); cudaCheckErrors("Kernel fail"); cudaMemcpy(my_ap_ptr, d_antenna_pattern, (361*1001)*sizeof(float), cudaMemcpyDeviceToHost); cudaCheckErrors("cudaMemcpy 2 fail"); float myval = 0.0; for (int i=0; i<361; i++) for (int j=0; j<1001; j++) if (LR.antenna_pattern[i][j] != myval++) {printf("mismatch at offset x: %d y: %d actual: %f expected: %f\n", i, j, LR.antenna_pattern[i][j], --myval); return 1;} printf("Results match!\n"); return 0; } </code></pre> If you prefer to use the flattened method, replace the <code>d_antenna_pattern</code> definition with: <pre><code>float *d_antenna_pattern; </code></pre> And change the kernel function parameter correspondingly to: <pre><code>__global__ void mykernel(float *ap){ </code></pre> Then access using the pointer arithmetic method in the kernel: <pre><code>ap[(i*1001)+j] = myval++; </code></pre>
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POHow do I pass a multi-dimensional array by reference when it's a property of a struct?
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USCommunity
UserOwnerUserId
1. USRobert Crovella
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POHow do I pass a multi-dimensional array by reference when it's a property of a struct?
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTAcceptedByOriginator
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.