Problems doing shared memory test

philprattszeliga · February 17, 2008, 6:27pm

Hello,

I am trying to run the following code:

__global__ void testKernel( float* g_idata, int data_size, unsigned int block_size) {  // access thread id  const unsigned int tid = (blockIdx.x * blockDim.x) + threadIdx.x;  // access number of threads in this block  //const unsigned int num_threads = blockDim.x;  const unsigned int num_threads = block_size;  int slices = data_size / num_threads;  int js;  __shared__ float shared_data[2048];  for(int i = 0; i < 2048; i++){   shared_data[i] = g_idata[i];  }  __syncthreads();  for(int i = 0; i < slices; i++){	js = (tid*slices) + i;	for(int j = 0; j < 200000; j++){  //g_idata[js] = g_idata[js] + 2;  //g_idata[js] = g_idata[js] - 1;  shared_data[js] = shared_data[js] + 2;  shared_data[js] = shared_data[js] - 1;	}  }    for(int i = 0; i < 2048; i++){   g_idata[i] = shared_data[i];  } }

My previous code using g_idata worked fine but now with shared memory only about the last half gets set to the right numbers. What am I doing wrong?

AndreiB · February 17, 2008, 7:28pm

I guess you’re using shared memory in a wrong way.
When you declare shared float shared_data[2048]; this means that those 2048 of floats will be shared among threads of a single block, not each thread getting its 2048 floats from shared memory pool.
So, when you copy data from global to shared memory all of your threads read the same data from global memory and write to same shared memory address.

Also it will be a good idea to insert __syncthreads() somewhere after you update shared memory, i.e. after the inner loop bacause without it you really may get incorrect results.

Topic		Replies	Views
Copying data into shared memory CUDA Programming and Performance	9	3818	July 1, 2009
Problem with dynamically allocated shared memory CUDA Programming and Performance	3	2740	July 11, 2008
help getting shared memory working CUDA Programming and Performance	11	4369	June 12, 2007
Shared memory vs global memory CUDA Programming and Performance	6	3496	April 30, 2007
shared memory issue CUDA Programming and Performance	2	2996	February 17, 2010
strange error about shared memory CUDA Programming and Performance	4	2348	November 30, 2007
shared problem CUDA Programming and Performance	2	1863	May 28, 2008
shared memory problem usage in variables CUDA Programming and Performance	8	2508	September 22, 2010
shared memory wrong allocation? CUDA Programming and Performance	2	865	July 29, 2009
Shared memory problem CUDA Programming and Performance	4	1596	April 1, 2009

Problems doing shared memory test

Related topics