
GPU technology is evolving rapidly. If you are an HPC developer moving to the CUDA development environment, you can debug your applications with TotalView for CUDA.
TotalView on Linux-x86-64 supports CUDA debugging:
- CUDA 4.0, 4.1 and 4.2
- Debugging host and device code in the same session
- CUDA running directly on Tesla or Fermi hardware
- Linux and GPU device thread visibility
- Full visibility to the hierarchical device, block, and thread memory
- Navigating device threads by logical and device coordinates
- CUDA function calls, host pinned memory regions and CUDA contexts
- Handling CUDA functions inline and on the stack
- Command line interface (CLI) commands for CUDA functions
- Applications that use multiple NVIDIA devices at the same time
- MPI applications on CUDA-accelerated clusters
- Unified Virtual Addressing and GPUDirect
- CUDA C++ and inline PTX
- Reporting memory errors and handling CUDA exceptions
>> Read the latest press release about TotalView's support for CUDA
>> Download GPU Debugging: Rogue Wave's TotalView on NVIDIA CUDA
>> Access PDF of the Debugging CUDA with TotalView Presentation