记录一次深度学习+SSH的配置

太久没碰深度学习了...全忘了主包重新来吧，自用。

一.本次配置的是dfine模型的环境：

在Vscode+远程上训练：

一.首先anaconda创建一个专属的虚拟环境：

python 复制代码

#创建虚拟环境 
conda create -n dfine python=3.12
conda activate dfine
 
# 安装核心依赖
pip install torch>=2.0.1 torchvision>=0.15.2 -i https://pypi.tuna.tsinghua.edu.cn/simple

pip install faster-coco-eval>=1.6.6 PyYAML tensorboard scipy
pip install calflops transformers loguru

python 复制代码

# ONNX相关工具
pip install onnx onnxsim onnxruntime
 
# TensorRT部署（需要CUDA环境）
pip install tensorrt
 
# OpenVINO部署  
pip install openvino
 
# 可视化工具
pip install fiftyone

-i https://pypi.tuna.tsinghua.edu.cn/simple

去网站：D-FINE:D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥 - GitCode

点击clone。

python 复制代码

cd D-FINE

//cd到刚刚clone的zip目录里，然后：
pip install -r requirements.txt

到这一步，接下来：验证

PyTorch 能否调用 GPU（CUDA 路径正确否）

python 复制代码

python -c "import torch, platform, cv2; print('Python', platform.python_version()); print('Torch', torch.__version__); print('CUDA available', torch.cuda.is_available()); print('cuDNN ver', torch.backends.cudnn.version())"

结果是：

python 复制代码

Python 3.12.11
Torch 2.8.0+cpu
CUDA available False
cuDNN ver None

说明：你装的是 CPU 版 PyTorch ，所以 CUDA/cuDNN 全是 None。

在 D-FINE 里训练或大图推理会很慢，必须换成 GPU 版 wheel。

因此我们需要：

两步切到 GPU 版（保持清华源加速）

你装的是 CPU 版 PyTorch ，所以 CUDA/cuDNN 全是 None。

在 D-FINE 里训练或大图推理会很慢，必须换成 GPU 版 wheel。

两步切到 GPU 版（保持清华源加速）

先卸载 CPU 版

pip uninstall -y torch torchvision

2.按你的驱动版本选一条命令（驱动 ≥ 525 就用 CUDA 12，否则用 11.8）：

这里通过命令：查看自己的cuda版本，我的是12.9.

python 复制代码

nvidia-smi

接下来去PyTorch

python 复制代码

pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cu128

因为官网目前还没有12.9的轮子，所以用下面这条指令：

|---------------|---------------|
| CUDA 12.1 | CUDA 11.8 |

复制代码

pip install torch==2.1.0 torchvision==0.16.0 --index-url https://download.pytorch.org/whl/cu121

python 复制代码

pip install torch==2.1.0 torchvision==0.16.0 --index-url https://download.pytorch.org/whl/cu118

最后我选用了这个：

python 复制代码

pip install torch==2.2.0 torchvision==0.17.0 --index-url https://download.pytorch.org/whl/cu118

注意：

用 官方 CUDA wheel 即可，conda cudatoolkit 会自动被 PyTorch 动态链接，无需额外装。

版本号与 D-FINE 代码兼容，2.1.0 是目前最稳的"万金油"。

装完再验证：

python 复制代码

python -c "import torch, torchvision, platform; print('Python', platform.python_version()); print('Torch', torch.__version__); print('CUDA available', torch.cuda.is_available()); print('cuDNN', torch.backends.cudnn.version())"

现在看到：

python 复制代码

Python 3.12.11
Torch 2.2.0+cu118
CUDA available True
cuDNN 8700

同时有警告：

PyTorch 2.2.0 是用 NumPy 1.x 编译的 ，而你环境里现在是 NumPy 2.2.6 ，ABI 不兼容，导致 PyTorch 在 import 时报 _ARRAY_API not found 的警告（目前只是警告，但随时可能崩）。

✅ 最稳解决方案：把 NumPy 降级到 1.x

python 复制代码

pip install "numpy<2" --force-reinstall

执行完这条语句后验证：

python 复制代码

python -c "import numpy, torch; print('NumPy', numpy.__version__); print('Torch', torch.__version__)"

python 复制代码

NumPy 1.26.4
Torch 2.2.0+cu118

完美！

现在 NumPy 1.26.4 + PyTorch 2.2.0+cu118 已兼容，警告消失，GPU 也能正常调用。

此时第一步，即虚拟环境已经配备完成。

Python 3.12 + PyTorch 2.2.0+cu118 + NumPy 1.26 兼容
GPU 可被 torch 调用（CUDA True，cuDNN 8700）

二.下载权重并且运行demo：

下载权重

官方 GitHub Release 页面：
https://github.com/Peterande/D-FINE/releases

选 dfine_swin_tiny.pth（最小、最快），放到项目根目录即可。

未完待续。。。