GPGPU and CUDA

GPGPU and CUDA Center for Visual Information Technology, IIIT Hyderabad

Singular Value Decomposition on GPU Sheetal Lahabar • 2 Step SVD bidiagonalization on GPU and hybrid CPU/GPU implementation for Diagonalization • Error due to lower precision < 0.001% • Upto 8x faster than Intel MKL • Upto 59x faster than MATLAB

Results on SVD

Artificial Neural Networks on GPU Sheetal Lahabar • ANN training and classification, batch learning formulated in CUBLAS • Average classification time for 1K test pattern is 0.759 sec • Upto 210 times faster than MATLAB • Upto 267 times faster than FANN

ANN on GPU results

Thank you

GPGPU and CUDA

GPGPU and CUDA

Presentation Transcript

GPGPU: CUDA vs OpenGL

GPGPU Programming

Cuda

CUDA

CUDA

CUDA-NP: Realizing Nested Thread-Level Parallelism in GPGPU Applications

Intermediate GPGPU Programming in CUDA

CUDA Lecture 7 CUDA Threads and Atomics

CUDA

GPGPU overview

GPGPU introduction

CUDA

CUDA

GPGPU

GPGPU Programming

GPGPU Programming