CUDA Driver API vs. CUDA runtime
The CUDA runtime makes it possible to compile and link your CUDA kernels into executables. This means that you don’t have to distribute cubin files with your application, or deal with loading them through the driver API. As you have noted, it is generally easier to use. In contrast, the driver API is harder to … Read more