ACADEMIA
NVIDIA Releases Industry's First OpenCL Performance Profiler for the GPU
New OpenCL Visual Profiler for Windows and Linux Now Available to Thousands of Developers
Leveraging the extensive performance instrumentation in NVIDIA's OpenCL drivers and hardware performance signals designed into NVIDIA GPUs, the OpenCL Visual Profiler provides developers with insight into performance bottlenecks and opportunities for optimization.
Key features include:
- Profiling of actual hardware signals, kernel efficiency, and instruction issue rate
- Timing of memory copies between system memory and GPU dedicated memory
- Customizable graphs to help developers focus in on problem areas
- Basic auto-analysis to reveal warp serialization problems
- Easy import/export to CSV for custom analysis
Chapters on the following topics and more are included in the guide:
- GPU Computing with OpenCL
- Performance Metrics
- Memory Optimizations
- NDRange Optimizations
- Instruction Optimizations
- Control Flow
- Performance Optimization Strategies
Professional developers and researchers are invited to apply for the program at: http://developer.nvidia.com/page/registered_developer_program.html