CUDA Debugging

Screenshot of TotalView GPU technology is evolving rapidly.  If you are an HPC developer moving to the CUDA development environment, you can debug your applications with TotalView for CUDA. 

TotalView on Linux-x86-64 supports CUDA debugging:
  • CUDA 4.0, 4.1 and 4.2
  • Debugging host and device code in the same session
  • CUDA running directly on Tesla or Fermi hardware
  • Linux and GPU device thread visibility
  • Full visibility to the hierarchical device, block, and thread memory
  • Navigating device threads by logical and device coordinates
  • CUDA function calls, host pinned memory regions and CUDA contexts
  • Handling CUDA functions inline and on the stack
  • Command line interface (CLI) commands for CUDA functions
  • Applications that use multiple NVIDIA devices at the same time
  • MPI applications on CUDA-accelerated clusters
  • Unified Virtual Addressing and GPUDirect
  • CUDA C++ and inline PTX
  • Reporting memory errors and handling CUDA exceptions
>> Read the latest press release about TotalView's support for CUDA
>> Download GPU Debugging: Rogue Wave's TotalView on NVIDIA CUDA
>> Access PDF of the Debugging CUDA with TotalView Presentation