宿主機為Ubuntu20.04 + gtx1060,Nvidia driver版本為510.85.02。
安裝環(huán)境為:tensorrt8.4
安裝完成后,一當(dāng)調(diào)用cuda環(huán)境就會報錯:Error 804: forward compatibility was attempted on non supported HW。
檢查問題原因
在Linux宿主機上使用docker(版本>= 19.3)之前,請確保安裝了nvidia-container-runtime和nvidia-container-toolkit:
sudo apt-get install nvidia-container-runtime nvidia-container-toolkit
并且確保nvidia-container-runtime-hook在PATH環(huán)境變量的路徑中:
:~$ which nvidia-container-runtime-hook
/usr/bin/nvidia-container-runtime-hook
cuda初探
既然是個cuda初始化就報錯的問題,那Gemfield不妨先拋開PyTorch,在當(dāng)前的Docker環(huán)境上直接寫一個最簡化的C程序來初始化CUDA設(shè)備,看看是否會出錯。
代碼:
#include <stdio.h>
#include <cuda_runtime.h>
int main() {
int device = 0;
int gpuDeviceCount = 0;
struct cudaDeviceProp properties;
cudaError_t cudaResultCode = cudaGetDeviceCount(&gpuDeviceCount);
if (cudaResultCode == cudaSuccess){
cudaGetDeviceProperties(&properties, device);
printf("%d GPU CUDA devices(s)(%d)\n", gpuDeviceCount, properties.major);
printf("\t Product Name: %s\n" , properties.name);
printf("\t TotalGlobalMem: %ld MB\n" , properties.totalGlobalMem/(1024^2));
printf("\t GPU Count: %d\n" , properties.multiProcessorCount);
printf("\t Kernels found: %d\n" , properties.concurrentKernels);
return 0;
}
printf("\t gemfield error: %d\n",cudaResultCode);
}
編譯:
g++ -I/usr/local/cuda-11.2/targets/x86_64-linux/include/ gemfield.cpp -o gemfield -L/usr/local/cuda-11.2/targets/x86_64-linux/lib/ -lcudart
~# ./gemfield
gemfield error: 804
Error 804: forward compatibility was attempted on non supported HW”,這個錯誤的意思是說:你的硬件不支持forward compatibility。
解決辦法
很簡單,將宿主主機的nvidia顯卡驅(qū)動更新成與鏡像相同的版本,然后再次安裝nvidia-container-runtime和nvidia-container-toolkit:
顯卡驅(qū)動安裝請參考:環(huán)境搭建01——Ubuntu如何查看顯卡信息及安裝NVDIA顯卡驅(qū)動_命名無能的博客-CSDN博客_ubuntu如何查看顯卡驅(qū)動
本文參考
PyTorch的CUDA錯誤:Error 804: forward compatibility was attempted on non supported HW - 知乎文章來源:http://www.zghlxwxcb.cn/news/detail-400886.html
如有侵權(quán),請聯(lián)系刪除。文章來源地址http://www.zghlxwxcb.cn/news/detail-400886.html
到了這里,關(guān)于PyTorch的CUDA錯誤:Error 804: forward compatibility was attempted on non supported HW的文章就介紹完了。如果您還想了解更多內(nèi)容,請在右上角搜索TOY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章,希望大家以后多多支持TOY模板網(wǎng)!