Index
A
Amdahl’s Law 101
asynchronous data transfers 86-89
asynchronous memory transfers 185
B
block 36, 82, 83
size 84
blocking call 68
breakpoints 173
C
C++ library 198
creating 200-203
integrating, with Python 198, 199
C++ Standard Template Library (STL) 218
CI/CD pipeline 230
CMake 67
code
GTest, using with 228-231
Pytest, using with 232-235
writing 222-225
code optimization 126
compiler directives 155
conditional breakpoints 176
context switching 7
convolution 120
sensor data, processing 120-122
CPUs 16
critical path 104
Ctypes
using 203-205
cuBLAS 218
matrices, multiplying 218-220
CUDA Capability specification 42
CUDA code
debugging, with VS Code 170-179
CUDA cores 11
CUDA Events 68
cuda-gdb 170
reference link 170
CUDA streams
limits, measuring 182, 183
matrices, multiplying 183-193
operations, overlaying...