NVIDIA CUDA Compiler Driver NVCC

NVIDIA CUDA Compiler Driver NVCC

        • [4.2.8.23. `--list-gpu-code` (`-code-ls`)](#4.2.8.23. --list-gpu-code (-code-ls))
        • [4.2.8.24. `--list-gpu-arch` (`-arch-ls`)](#4.2.8.24. --list-gpu-arch (-arch-ls))
  • References

https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/

The documentation for nvcc, the CUDA compiler driver.

CUDA code runs on both the central processing unit (CPU) and graphics processing unit (GPU). NVCC separates these two parts and sends host code (the part of code which will be run on the CPU) to a C compiler like GNU Compiler Collection (GCC) or Intel C++ Compiler (ICC) or Microsoft Visual C++ Compiler, and sends the device code (the part which will run on the GPU) to the GPU. The device code is further compiled by NVCC. NVCC is based on LLVM.

Any source file containing CUDA language extensions (.cu) must be compiled with nvcc.

  • Nvidia CUDA Compiler (NVCC)

Nvidia CUDA Compiler (NVCC) is a compiler by Nvidia intended for use with CUDA.

复制代码
strong@foreverstrong:~$ which nvcc
/usr/local/cuda-8.0/bin/nvcc

C:\Users\cheng>nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Fri_Feb__8_19:08:26_Pacific_Standard_Time_2019
Cuda compilation tools, release 10.1, V10.1.105

C:\Users\cheng>

C:\Users\cheng>nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Fri_Feb__8_19:08:26_Pacific_Standard_Time_2019
Cuda compilation tools, release 10.1, V10.1.105

C:\Users\cheng>

C:\Users\cheng>nvidia-smi
Tue Feb 25 23:59:24 2025
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 528.49       Driver Version: 528.49       CUDA Version: 12.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name            TCC/WDDM | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA GeForce ... WDDM  | 00000000:01:00.0 Off |                  N/A |
| N/A   44C    P0    22W /  60W |      0MiB /  6144MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

C:\Users\cheng>

nvidia-sminvcc --version 显示的 CUDA Version 差异:

  • nvcc --version refers to the version of the CUDA toolkit installed on the system. The CUDA toolkit includes the compiler, libraries, and other tools. (NVIDIA Toolkit driver)
  • nvidia-smi refers to the CUDA GPU driver version supported by the installed NVIDIA graphics driver. (GPU driver, aka CUDA Driver)

The CUDA driver version must be greater than or equal to the CUDA toolkit version for proper functionality.

复制代码
strong@foreverstrong:~$ ls -l /dev/nv*
crw-rw-rw- 1 root root 195,   0 11月  7 14:59 /dev/nvidia0
crw-rw-rw- 1 root root 195, 255 11月  7 14:59 /dev/nvidiactl
crw-rw-rw- 1 root root 195, 254 11月  7 14:59 /dev/nvidia-modeset
crw-rw-rw- 1 root root 242,   0 11月  7 14:59 /dev/nvidia-uvm
crw-rw-rw- 1 root root 242,   1 11月  7 19:07 /dev/nvidia-uvm-tools
4.2.8.23. --list-gpu-code (-code-ls)

List the non-accelerated gpu architectures (sm_XX) supported by the tool and exit.

If both --list-gpu-code and --list-gpu-arch are set, the list is displayed using the same format as the --generate-code value.

4.2.8.24. --list-gpu-arch (-arch-ls)

List the non-accelerated virtual device architectures (compute_XX) supported by the tool and exit.

If both --list-gpu-arch and --list-gpu-code are set, the list is displayed using the same format as the --generate-code value.

复制代码
nvcc --list-gpu-arch

References

1 Yongqiang Cheng, https://yongqiang.blog.csdn.net/

相关推荐
牛奶3 天前
AI 能赚钱了——但赚的不是你
人工智能·ai编程·nvidia
basketball61618 天前
AI Infra 硬件体系与编程模型:17. CUDA编程基础:底层驱动 API 调用
人工智能·microsoft·nvidia·cuda
basketball61621 天前
AI Infra 硬件体系与编程模型:14. CUDA编程基础:事件与精确性能测量
人工智能·nvidia·cuda
kyle~21 天前
推理部署---CUDA 执行模型(SM、Block、Warp 与 SIMT)
人工智能·nvidia·cuda
basketball61621 天前
AI Infra 硬件体系与编程模型:15. CUDA编程基础:混合精度计算
人工智能·nvidia·cuda
小白狮ww22 天前
3B 参数,毫秒级响应:LocateAnything 如何重新定义开放世界目标检测
人工智能·目标检测·计算机视觉·视觉检测·大语言模型·nvidia·locateanything
kyle~23 天前
ROS 2 与 Isaac Sim 联合仿真(一)体系架构、环境选型与基础通信闭环
c++·机器人·nvidia·仿真·ros2
kyle~23 天前
ROS 2 与 Isaac Sim 联合仿真(三):工程化部署、性能优化、多机器人与 Sim-to-Real
机器人·nvidia·仿真·ros2
CV-deeplearning23 天前
NVIDIA CV-CUDA:GPU 全流程加速计算机视觉,pip 一键安装替代 OpenCV,微软/腾讯/百度/字节全在用,云级图像处理吞吐量飙升 10 倍
opencv·计算机视觉·pip·nvidia·cuda·gpu加速·cv-cuda