NVIDIA CUDA Compiler Driver NVCC

NVIDIA CUDA Compiler Driver NVCC

        • [4.2.8.23. `--list-gpu-code` (`-code-ls`)](#4.2.8.23. --list-gpu-code (-code-ls))
        • [4.2.8.24. `--list-gpu-arch` (`-arch-ls`)](#4.2.8.24. --list-gpu-arch (-arch-ls))
  • References

https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/

The documentation for nvcc, the CUDA compiler driver.

CUDA code runs on both the central processing unit (CPU) and graphics processing unit (GPU). NVCC separates these two parts and sends host code (the part of code which will be run on the CPU) to a C compiler like GNU Compiler Collection (GCC) or Intel C++ Compiler (ICC) or Microsoft Visual C++ Compiler, and sends the device code (the part which will run on the GPU) to the GPU. The device code is further compiled by NVCC. NVCC is based on LLVM.

Any source file containing CUDA language extensions (.cu) must be compiled with nvcc.

  • Nvidia CUDA Compiler (NVCC)

Nvidia CUDA Compiler (NVCC) is a compiler by Nvidia intended for use with CUDA.

复制代码
strong@foreverstrong:~$ which nvcc
/usr/local/cuda-8.0/bin/nvcc

C:\Users\cheng>nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Fri_Feb__8_19:08:26_Pacific_Standard_Time_2019
Cuda compilation tools, release 10.1, V10.1.105

C:\Users\cheng>

C:\Users\cheng>nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Fri_Feb__8_19:08:26_Pacific_Standard_Time_2019
Cuda compilation tools, release 10.1, V10.1.105

C:\Users\cheng>

C:\Users\cheng>nvidia-smi
Tue Feb 25 23:59:24 2025
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 528.49       Driver Version: 528.49       CUDA Version: 12.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name            TCC/WDDM | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA GeForce ... WDDM  | 00000000:01:00.0 Off |                  N/A |
| N/A   44C    P0    22W /  60W |      0MiB /  6144MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

C:\Users\cheng>

nvidia-sminvcc --version 显示的 CUDA Version 差异:

  • nvcc --version refers to the version of the CUDA toolkit installed on the system. The CUDA toolkit includes the compiler, libraries, and other tools. (NVIDIA Toolkit driver)
  • nvidia-smi refers to the CUDA GPU driver version supported by the installed NVIDIA graphics driver. (GPU driver, aka CUDA Driver)

The CUDA driver version must be greater than or equal to the CUDA toolkit version for proper functionality.

复制代码
strong@foreverstrong:~$ ls -l /dev/nv*
crw-rw-rw- 1 root root 195,   0 11月  7 14:59 /dev/nvidia0
crw-rw-rw- 1 root root 195, 255 11月  7 14:59 /dev/nvidiactl
crw-rw-rw- 1 root root 195, 254 11月  7 14:59 /dev/nvidia-modeset
crw-rw-rw- 1 root root 242,   0 11月  7 14:59 /dev/nvidia-uvm
crw-rw-rw- 1 root root 242,   1 11月  7 19:07 /dev/nvidia-uvm-tools
4.2.8.23. --list-gpu-code (-code-ls)

List the non-accelerated gpu architectures (sm_XX) supported by the tool and exit.

If both --list-gpu-code and --list-gpu-arch are set, the list is displayed using the same format as the --generate-code value.

4.2.8.24. --list-gpu-arch (-arch-ls)

List the non-accelerated virtual device architectures (compute_XX) supported by the tool and exit.

If both --list-gpu-arch and --list-gpu-code are set, the list is displayed using the same format as the --generate-code value.

复制代码
nvcc --list-gpu-arch

References

1 Yongqiang Cheng, https://yongqiang.blog.csdn.net/

相关推荐
OpenCSG1 天前
Cosmos3:NVIDIA 把世界模型做成了“理解、生成、模拟、行动”的统一入口
人工智能·大模型·nvidia·opencsg
OpenBayes贝式计算3 天前
端侧同尺寸 SOTA:OpenBMB 发布 1B 参数模型 MiniCPM5-1B;集成多级平行语料与多语言词典:SMOL 翻译数据集开源
计算机视觉·google·nvidia
OpenBayes贝式计算3 天前
教程上新丨英伟达开源 LocateAnything,3B 模型可实现图像 + 视频的目标指向 / 开放词汇目标检测/指代表达定位 / OCR 文本定位等功能
计算机视觉·agent·nvidia
视***间4 天前
算力筑基,智领人形机器人新时代 —— 英伟达 × 宇树科技携手推进具身智能,视程空间基于 NVIDIA 全栈算力产品助力机器人产业落地
人工智能·机器人·nvidia·机器狗·gpt-oss·视程空间·宇树机器人
wangruofeng5 天前
NVIDIA RTX Spark 背后,联发科也坐上了主桌
aigc·nvidia
AI科技大本营6 天前
MediaTek 与NVIDIA 合作推出 NVIDIA RTX Spark,驱动下一代 Windows PC 体验
nvidia·mediatek
阿里云大数据AI技术7 天前
基于阿里云 DataWorks Data Agent 进行大模型热度分析
人工智能·agent·nvidia
comedate7 天前
FMT_UNICODE 与 CUDA 编码配置专栏技术文档
c++·utf-8·nvcc
Together_CZ12 天前
Windows 下 CUDA / cuDNN / PyTorch GPU 训练环境搭建完整指南:虚拟环境与系统级环境两种方案
virtualenv·nvidia·cuda·cudnn·rtx 5060
元让_vincent12 天前
论文Review SLAM cuVSLAM | NVIDIA 2025 | CUDA加速的视觉里程计与建图系统
nvidia·视觉slam·gpu加速