Hi,
I’m trying to use cuda-memcheck.exe to debug one of our CUDA programs. But I get very non-sensical behavior (basically just unknown errors, with no kernel names).
I then tried using cuda-memcheck.exe on one of the CUDA SDK Examples, the “Matrix Multiplication (CUDA Runtime API Version)” (from http://developer.nvidia.com/cuda-cc-sdk-code-samples). When I run the matrixmul.exe through cuda-memcheck.exe, the multiplications fail (several thousand errors). It runs fine without matrixmul.exe.
I also notice that the display is scrambled with random pixels during the kernel launch (I’ve attached a monitor to the C2075. It is running in TCC mode.). I tried removing the monitor and running through a VNC client, but the matrixMul example still fails.
I’m running Win7 Pro 64-bit with the following driver:
+------------------------------------------------------+ | NVIDIA-SMI 3.296.70 Driver Version: 296.70 | |-------------------------------+----------------------+----------------------+ | Nb. Name TCC/WDDM | Bus Id Disp. | Volatile ECC SB / DB | | Fan Temp Power Usage /Cap | Memory Usage | GPU Util. Compute M. | |===============================+======================+======================| | 0. Tesla C2075 TCC | 0000:01:00.0 On | 0 0 | | 30% 66 C P12 36W / 225W | 0% 9MB / 5375MB | 0% Default | |-------------------------------+----------------------+----------------------| | Compute processes: GPU Memory | | GPU PID Process name Usage | |=============================================================================| | No running compute processes found | +-----------------------------------------------------------------------------+ Memcheck.exe is from the the CUDA 4.2.9 toolkit.
Can somebody help me here?