Home > Software engineering >  How to apply cuda-memcheck to an app with piped inputs from standard I/O
How to apply cuda-memcheck to an app with piped inputs from standard I/O

Time:06-04

I want to use cuda-memcheck for an app with standard I/O.

The app, dut, reads standard Input and writes standard Output.

cat input.txt | cuda-memcheck ./dut -dutoptions > output.txt

In this case, the dut app seems to be launched, but cuda-memcheck doesn’t work well.

How can I do that?

CodePudding user response:

One possible method:

Place your command to be executed into a script. Run cuda-memcheck on that script.

$ cat input.txt
4

$ cat t2030.cu
#include <iostream>

__global__ void k(int *d){
  *d = 5;
}

int main(){

  int *d = NULL;
#ifndef FAULT
  cudaMalloc(&d, sizeof(d[0]));
#endif
  int a;
  std::cin  >> a;
  std::cout << "value read: " << a << std::endl;
  k<<<1,1>>>(d);
  cudaDeviceSynchronize();
}

$ nvcc -o t2030 t2030.cu
$ cat test
cat input.txt | ./t2030 > output.txt
$ chmod  x test
$ cuda-memcheck ./test
========= CUDA-MEMCHECK
========= ERROR SUMMARY: 0 errors
$ cat output.txt
value read: 4
$ nvcc -DFAULT -o t2030 t2030.cu
$ cuda-memcheck ./test
========= CUDA-MEMCHECK
========= Invalid __global__ write of size 4
=========     at 0x00000050 in k(int*)
=========     by thread (0,0,0) in block (0,0,0)
=========     Address 0x00000000 is out of bounds
=========     Device Frame:k(int*) (k(int*) : 0x50)
=========     Saved host backtrace up to driver entry point at kernel launch time
=========     Host Frame:/lib64/libcuda.so.1 [0x20d6ea]
=========     Host Frame:./t2030 [0x864b]
=========     Host Frame:./t2030 [0x5ec88]
=========     Host Frame:./t2030 [0x3ea0]
=========     Host Frame:./t2030 [0x3d81]
=========     Host Frame:./t2030 [0x3da9]
=========     Host Frame:./t2030 [0x3c2a]
=========     Host Frame:/lib64/libc.so.6 (__libc_start_main   0xf5) [0x21b15]
=========     Host Frame:./t2030 [0x3a91]
=========
========= Program hit cudaErrorLaunchFailure (error 719) due to "unspecified launch failure" on CUDA API call to cudaDeviceSynchronize.
=========     Saved host backtrace up to driver entry point at error
=========     Host Frame:/lib64/libcuda.so.1 [0x3bd253]
=========     Host Frame:./t2030 [0x3a047]
=========     Host Frame:./t2030 [0x3c2f]
=========     Host Frame:/lib64/libc.so.6 (__libc_start_main   0xf5) [0x21b15]
=========     Host Frame:./t2030 [0x3a91]
=========
========= ERROR SUMMARY: 2 errors
$

note that for a volta or newer device, cuda-memcheck should be replaced by compute-sanitizer.

  • Related