Note that there are some explanatory texts on larger screens.

plurals
  1. POTimeout in CUDA? / fermi / gtx465
    primarykey
    data
    text
    <p>I am using CUDA SDK 3.1 on MS VS2005 with GPU GTX465 1 GB. I have such a kernel function:</p> <pre><code>__global__ void CRT_GPU_2(float *A, float *X, float *Y, float *Z, float *pIntensity, float *firstTime, float *pointsNumber) { int holo_x = blockIdx.x*20 + threadIdx.x; int holo_y = blockIdx.y*20 + threadIdx.y; float k=2.0f*3.14f/0.000000054f; if (firstTime[0]==1.0f) { pIntensity[holo_x+holo_y*MAX_FINAL_X]=0.0f; } for (int i=0; i&lt;pointsNumber[0]; i++) { pIntensity[holo_x+holo_y*MAX_FINAL_X]=pIntensity[holo_x+holo_y*MAX_FINAL_X]+A[i]*cosf(k*sqrtf(pow(holo_x-X[i],2.0f)+pow(holo_y-Y[i],2.0f)+pow(Z[i],2.0f))); } __syncthreads(); } </code></pre> <p>and this is function which calls kernel function:</p> <pre><code>extern "C" void go2(float *pDATA, float *X, float *Y, float *Z, float *pIntensity, float *firstTime, float *pointsNumber) { dim3 blockGridRows(MAX_FINAL_X/20,MAX_FINAL_Y/20); dim3 threadBlockRows(20, 20); CRT_GPU_2&lt;&lt;&lt;blockGridRows, threadBlockRows&gt;&gt;&gt;(pDATA, X, Y, Z, pIntensity,firstTime, pointsNumber); CUT_CHECK_ERROR("multiplyNumbersGPU() execution failed\n"); CUDA_SAFE_CALL( cudaThreadSynchronize() ); } </code></pre> <p>I am loading in loop all the paramteres to this function (for example 4096 elements for each parameter in one loop iteration). In total I want to make this kernel for 32768 elements for each parameter after all loop iterations. </p> <p>The MAX_FINAL_X is 1920 and MAX_FINAL_Y is 1080.</p> <p>When I am starting alghoritm first iteration goes very fast and after one or two iteration more I get information about CUDA timeout error. I used this alghoritm on GPU gtx260 and it was doing better as far as I remember...</p> <p>Could You help me.. maybe I am doing some mistake according to new Fermi arch in this algorithm? </p>
    singulars
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload