mirror of
https://github.com/NVIDIA/cuda-samples.git
synced 2025-04-24 16:51:22 +01:00
36 lines
1.3 KiB
Markdown
36 lines
1.3 KiB
Markdown
# UnifiedMemoryPerf - Unified and other CUDA Memories Performance
|
|
|
|
## Description
|
|
|
|
This sample demonstrates the performance comparision using matrix multiplication kernel of Unified Memory with/without hints and other types of memory like zero copy buffers, pageable, pagelocked memory performing synchronous and Asynchronous transfers on a single GPU.
|
|
|
|
## Key Concepts
|
|
|
|
CUDA Systems Integration, Unified Memory, CUDA Streams and Events, Pinned System Paged Memory
|
|
|
|
## Supported SM Architectures
|
|
|
|
## Supported OSes
|
|
|
|
Linux, Windows
|
|
|
|
## Supported CPU Architecture
|
|
|
|
x86_64, ppc64le, armv7l, aarch64
|
|
|
|
## CUDA APIs involved
|
|
|
|
### [CUDA Runtime API](http://docs.nvidia.com/cuda/cuda-runtime-api/index.html)
|
|
cudaMemcpy, cudaStreamDestroy, cudaMemPrefetchAsync, cudaFree, cudaMallocHost, cudaMallocManaged, cudaStreamAttachMemAsync, cudaHostGetDevicePointer, cudaFreeHost, cudaStreamSynchronize, cudaMalloc, cudaMemcpyAsync, cudaStreamCreate, cudaGetDeviceProperties
|
|
|
|
## Dependencies needed to build/run
|
|
[UVM](../../../README.md#uvm)
|
|
|
|
## Prerequisites
|
|
|
|
Download and install the [CUDA Toolkit 12.5](https://developer.nvidia.com/cuda-downloads) for your corresponding platform.
|
|
Make sure the dependencies mentioned in [Dependencies]() section above are installed.
|
|
|
|
## References (for more details)
|
|
|