解决CUDA driver version is insufficient for CUDA runtime version
问题:
在docker跑训练insightface的Oneflow代码时,出现报错
Failed to get cuda runtime version: CUDA driver version is insufficient for CUDA runtime version
原因:
1、查看CUDA runtime version
cat /usr/local/cuda/version.txt
得到我的docker里的CUDA版本是10.0.130
CUDA Version 10.0.130
2、CUDA版本对显卡驱动版本有要求,见如下链接。 https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html
CUDA Toolkit | Linux x86_64 Driver Version | Windows x86_64 Driver Version |
---|
CUDA 11.0.3 Update 1 | >= 450.51.06 | >= 451.82 | CUDA 11.0.2 GA | >= 450.51.05 | >= 451.48 | CUDA 11.0.1 RC | >= 450.36.06 | >= 451.22 | CUDA 10.2.89 | >= 440.33 | >= 441.22 | CUDA 10.1 (10.1.105 general release, and updates) | >= 418.39 | >= 418.96 | CUDA 10.0.130 | >= 410.48 | >= 411.31 | CUDA 9.2 (9.2.148 Update 1) | >= 396.37 | >= 398.26 | CUDA 9.2 (9.2.88) | >= 396.26 | >= 397.44 |
cat /proc/driver/nvidia/version看了一下服务器的显卡驱动是 418.67,应该安装CUDA 10.1,而我装的是10.0.130的cuda。
NVRM version: NVIDIA UNIX x86_64 Kernel Module 418.67 Sat Apr 6 03:07:24 CDT 2019
GCC version: gcc version 7.3.0 (Ubuntu 7.3.0-27ubuntu1~18.04)
解决:
安装CUDA 10.1
(1)首先在https://developer.nvidia.com/cuda-toolkit-archive中根据机子环境,下载相应的cuda10.1安装文件,Installer Type我选的是runfile(local),安装步骤会简单点。
wget https://developer.download.nvidia.com/compute/cuda/10.1/Prod/local_installers/cuda_10.1.243_418.87.00_linux.runsudo sh
(2)安装
sh cuda_10.1.243_418.87.00_linux.run
出现同样的错误,未解决! 待后续找到解决办法再更新。
|