li wei
601384af49
Merge 6aac4717b878ce3a78c1159ff2794e681a914e5b into 8d564d5e3afdab5dda868f42a13d85f3d0e75bc9
2025-02-20 17:35:32 +01:00
Rob Armstrong
940a4c7a91
memMapIpc: Resolve build-time warnings and minor potential issues ( #329 )
...
* Fix compute performance calculation type casting in gpuGetMaxGflopsDeviceIdDRV() for #109
* 3_CUDA_Features/memMapIPCDrv: Increase procIdx buffer size to prevent potential buffer overflow
* memMapIPCDrv: Fix memory leaks and improve header inclusion
- Remove redundant string.h header
- Add memory cleanup for dynamically allocated JIT options and log buffer
- Fix printf format specifier for unsigned long long
2025-02-19 15:52:20 -08:00
shawnz
fb6fcb0110
Enhancement for finding GLFW on WIN and copy .dll files to executable dir for some samples
2025-02-14 22:37:51 +08:00
Rob Armstrong
7a9bd38ecc
Update OpenGL sample build settings on Windows
2025-02-10 23:12:38 -08:00
Rob Armstrong
95308ffc23
Add missing build targets to general samples
2025-02-08 13:04:26 -05:00
Rob Armstrong
56852fbb50
Add explicit system libraries needed on some Linux distributions
2025-02-07 15:43:17 -05:00
Rob Armstrong
e8a041d783
Ensure -Wno-deprecated-gpu-targets is passed to PTX and fatbin compilation steps
2025-02-04 12:07:51 -08:00
Rob Armstrong
c631850c15
Remove compute capability 8.7 build target for non-Tegra builds
2025-02-04 12:01:38 -08:00
Rob Armstrong
9539ca5fa3
Remove compute capability 7.2 build target for non-Tegra builds
2025-02-04 11:58:25 -08:00
Rob Armstrong
0f4bdfad99
CMake: Add '-Wno-deprecated-gpu-targets' to suppress warning messages during build about Maxwell, Pascal, and Volta
2025-02-04 11:51:17 -08:00
shawnz
ab0b386ac4
Update tegra SMs]
2025-01-23 16:20:34 +08:00
shawnz
08fae276b4
Add Tegra SMs in CMakeLists.txt general samples
2025-01-23 11:02:56 +08:00
Rob Armstrong
b203467419
Update CUDA architectures list
2025-01-22 17:49:44 -08:00
XSShawnZeng
95128a1cf3
Update CMakeLists.txt
2025-01-20 10:05:45 +08:00
XSShawnZeng
3848a7c63c
Update CMakeLists.txt
2025-01-20 10:05:18 +08:00
XSShawnZeng
d9633314f8
Update CMakeLists.txt
2025-01-20 10:04:50 +08:00
XSShawnZeng
576c24f97e
Update CMakeLists.txt
2025-01-20 10:04:24 +08:00
XSShawnZeng
970a6a1151
Update CMakeLists.txt
2025-01-20 10:04:02 +08:00
shawnz
76e2d2052c
Update separate SM list for cdp samples and update the watershedSegmentationNPP for build failed of Bug 4668487
2025-01-17 12:04:27 +08:00
Jonathan Bentz
e633580eff
a couple more typos in comments
2025-01-14 18:12:09 -06:00
Jonathan Bentz
df9ea79df4
typo errors in a few CMakelists files
2025-01-14 18:09:26 -06:00
Rob Armstrong
f8fbd04007
Update CMake module search path
2025-01-14 09:14:29 -08:00
Rob Armstrong
ce045e2ae9
Merge branch 'cmake_transition' into aarch64_guarding
2025-01-09 10:03:16 -08:00
Rob Armstrong
f753e86e7a
Update all samples to build position-independent code
2025-01-09 09:59:36 -08:00
Jonathan Bentz
a1cf9e4183
Changes for building for aarch64, specifically jetson orin nano
2025-01-08 17:05:45 -06:00
Jonathan Bentz
f9dd4323af
adding code to Linux-only samples to guard against building on Windows
2024-12-18 14:36:12 -06:00
Rob Armstrong
bfd956bc5e
Remove README references to PPC processors (no longer supported)
2024-12-18 10:54:37 -08:00
Rob Armstrong
0f5821a8c8
Remove outdated build instructions from README.md
2024-12-18 10:52:24 -08:00
Rob Armstrong
abb97e1dfb
Update copy_directory_if_different to copy_directory for CMake 3.20 compatibility
2024-12-17 10:22:02 -08:00
Rob Armstrong
d54d4d7419
Update supported architecture list for some samples
2024-12-16 16:35:23 -08:00
Rob Armstrong
03719b7623
Change remaining build targets to specify active SM variants
2024-12-16 16:17:14 -08:00
Rob Armstrong
090f957854
Refactor CMakeLists.txt under 3_CUDA_Features
2024-12-16 14:37:14 -08:00
Rob Armstrong
2f826e305a
Update ptxjit
2024-12-12 11:34:01 -08:00
Rob Armstrong
9bebdf7ef4
Update jacobiCudaGraphs, memMapIPCDrv, newdelete, simpleCudaGraphs, tf32TensorCoreGemm, warpAggregatedAtomicsCG
2024-12-12 11:29:48 -08:00
Rob Armstrong
6fd8228242
Update graphMemoryFootprint, graphMemoryNodes, immaTensorCoreGemm
2024-12-12 11:19:17 -08:00
Rob Armstrong
76210c84f7
Update cudaCompressibleMemory, cudaTensorCoreGemm, dmmaTensorCoreGemm, globalToShmemAsyncCopy, graphConditionalNodes
2024-12-12 11:17:35 -08:00
Rob Armstrong
62d32b38d7
Update bindlessTexture, CDP samples
2024-12-12 11:11:51 -08:00
Rob Armstrong
dd73281bc6
Update StreamPriorities, bf16TensorCoreGemm, binaryPartitionCG
2024-12-12 11:02:13 -08:00
Rob Armstrong
82bcada84c
Remove now-unnecessary Visual Studio project files
2024-12-11 16:25:06 +00:00
Rob Nertney
9c688d7ff7
Updating samples for CUDA 12.5
2024-07-25 16:30:13 +00:00
aioprli
6aac4717b8
Update globalToShmemAsyncCopy.cu
...
Fix two obvious errors, the first one is that five tasks were submitted to pipeline at the same time and task 4 conflicts with task 0, the remaining two are copy errors
2024-05-06 22:03:16 +08:00
Rob Nertney
5f97d7d0df
Updating graphConditionalNodes orphan directory
2024-04-10 19:44:42 +00:00
Rob Nertney
cd3bc1fa8e
Updating samples for CUDA 12.4
2024-03-05 20:53:50 +00:00
Rob Nertney
e8568c4173
Fixing jitlto regression, including missing cuDLA source files for bug #235 , and updating changelogs
2023-11-09 16:52:00 +00:00
Rob Nertney
b5c84e6996
Updating Samples for 12.3 and updating props files
2023-10-23 18:44:49 +00:00
Rob Nertney
03309a2d42
Changelog updates
2023-06-29 19:33:40 +00:00
Rob Nertney
81cf058e30
Updating Samples for 12.1
2023-03-01 01:41:29 +00:00
Rob Nertney
00bb9bc367
Updating files for Ada architecture
2023-02-27 22:33:19 +00:00
Rob Nertney
2b689228b7
Updating samples for 12.0
2022-12-08 20:19:55 +00:00
Rob Nertney
81992093d2
Update samples for CUDA 11.8 with correct props
2022-10-14 17:43:37 -07:00