160 likes | 635 Views
Blocked 2D Convolution. Ravi Sankar P Nair 010469036. Implement 2D Convolution. Source: http://www.songho.ca/dsp/convolution/convolution2d_example.html. Implement 2D Convolution.cpp in GPU Kernel. Implement 2D Convolution.cpp in GPU Kernel. Use Constant memory to store M matrix.
E N D
Blocked 2D Convolution Ravi Sankar P Nair 010469036
Implement 2D Convolution Source: http://www.songho.ca/dsp/convolution/convolution2d_example.html
Implement 2D Convolution.cpp in GPU Kernel Use Constant memory to store M matrix
Implement 2D Convolution.cpp in GPU Kernel Use Constant memory to store M matrix
Performance Testing CPU vs. GPU What is the measured floating-point computation rate for the CPU and GPU kernels on this application? How do they each scale with the size of the input? #include <sys/time.h>
Performance Testing CPU vs. GPU What is the measured floating-point computation rate for the CPU and GPU kernels on this application? How do they each scale with the size of the input? Alternate Timer method
Performance Testing CPU vs. GPU What is the measured floating-point computation rate for the CPU and GPU kernels on this application? How do they each scale with the size of the input? #include <sys/time.h>
Performance Testing CPU vs. GPU 2. How much time is spent as an overhead cost of using the GPU for computation? Consider all code executed within your host function, with the exception of the kernel itself, as overhead. How does the overhead scale with the size of the input?
Performance Testing CPU vs. GPU Table shows values in micro seconds. Run on GTX 480 pacman.ddns.uark.edu Total Setup = Setup M,N + Setup GPU call Over Head GPU = Setup GPU Call – GPU kernel Over Head Setup = Total Setup – GPU kernel Over Head Main = Total Main program – GPU Kernel
Performance Testing CPU vs. GPU Table shows values in micro seconds. Run on GTX 480 pacman.ddns.uark.edu (Alternate Timer) Total Setup = Setup M,N + Setup GPU call Over Head GPU = Setup GPU Call – GPU kernel Over Head Setup = Total Setup – GPU kernel Over Head Main = Total Main program – GPU Kernel
Performance Testing CPU vs. GPU Run on GTX 480 pacman.ddns.uark.edu
Performance Testing CPU vs. GPU Table shows values in micro seconds. Run on GTX 295 stargate.uark.edu Total Setup = Setup M,N + Setup GPU call Over Head GPU = Setup GPU Call – GPU kernel Over Head Setup = Total Setup – GPU kernel Over Head Main = Total Main program – GPU Kernel
Performance Testing CPU vs. GPU Run on GTX 295 stargate.uark.edu