Getting Started: Cuda C++
This chapter comprises of:
- GPU Intro.
- Transfer Of Data From CPU to GPU.
- GPU Kernels.
- Streaming Multiprocessor and GPU features.
- What are Threads, Blocks and Grids?
- Writing our first program.
- Cuda Error Checking.
- Putting it all together: Vector Addition.
GPU Intro
When a C++ program complied, the program is converted into machine code which can be executed on the CPU. When a program is written for CPU the machine code is excecuted sequentially inside a core. If there are multiple cores present then each core can be used to perform the desired operation parallely(using OpenMp in C++) but the instructions inside each core will be exceuted sequentially.
To read more click the link below. Since writing math using plain html is time consuming, I use jupyter-book for wrtiing my blogs. (To read comfortably please toggle the side bar)
https://yogheswaran-a.github.io/cuda-notes/01-getting-started.html
More Cuda blogs:
https://yogheswaran-a.github.io/cuda-notes/00-landing-page.html