mirror of
https://github.com/NVIDIA/cuda-samples.git
synced 2025-04-04 07:21:33 +01:00
inlinePTX - Using Inline PTX
Description
A simple test application that demonstrates a new CUDA 4.0 ability to embed PTX in a CUDA kernel.
Key Concepts
Performance Strategies, PTX Assembly, CUDA Driver API
Supported SM Architectures
SM 5.0 SM 5.2 SM 5.3 SM 6.0 SM 6.1 SM 7.0 SM 7.2 SM 7.5 SM 8.0 SM 8.6 SM 8.7 SM 8.9 SM 9.0
Supported OSes
Linux, Windows
Supported CPU Architecture
x86_64, armv7l
CUDA APIs involved
CUDA Runtime API
cudaMemcpy, cudaFree, cudaMallocHost, cudaGetLastError, cudaGridSize, cudaBlockSize, cudaDeviceSynchronize, cudaFreeHost, cudaMalloc
Prerequisites
Download and install the CUDA Toolkit 12.5 for your corresponding platform.