已解決RuntimeError: CUDA error: invalid device ordinal CUDA kernel errors might be asynchronously repo

2年前作者：高斯小哥分類：Toy博客閱讀(30)違法舉報(bào)

這篇具有很好參考價(jià)值的文章主要介紹了已解決RuntimeError: CUDA error: invalid device ordinal CUDA kernel errors might be asynchronously repo。希望對(duì)大家有所幫助。如果存在錯(cuò)誤或未考慮完全的地方，請(qǐng)大家不吝賜教，您也可以點(diǎn)擊"舉報(bào)違法"按鈕提交疑問。

參考鏈接

報(bào)錯(cuò)分析

當(dāng)運(yùn)行以下代碼出現(xiàn)報(bào)錯(cuò):

# self.opt.gpu_ids = ["1"]
torch.cuda.set_device(self.opt.gpu_ids[0])

報(bào)錯(cuò)信息如下
RuntimeError: CUDA error: invalid device ordinal
CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.

報(bào)錯(cuò)完整截圖
runtimeerror: cuda error: invalid device ordinal cuda kernel errors might be,bug,python,pytorch,深度學(xué)習(xí)
報(bào)錯(cuò)的信息告訴我們，編號(hào)"1"是無(wú)效的設(shè)備序號(hào)。但我使用的設(shè)備屬于單機(jī)多卡，是有編號(hào)為"1"的顯卡的。

解決方法

檢查報(bào)錯(cuò)代碼前面執(zhí)行過的程序，特別是導(dǎo)入第三方庫(kù)部分，發(fā)現(xiàn)利用os庫(kù)指定了該程序可見的GPU編號(hào)及數(shù)量，即：

import os
os.environ["CUDA_VISIBLE_DEVICES"] = "1"

因此，注釋掉os.environ["CUDA_VISIBLE_DEVICES"] = "1"后，重新運(yùn)行程序，順利解決bug~

結(jié)尾

親愛的讀者，首先感謝您抽出寶貴的時(shí)間來(lái)閱讀我們的博客。我們真誠(chéng)地歡迎您留下評(píng)論和意見，因?yàn)檫@對(duì)我們來(lái)說(shuō)意義非凡。
俗話說(shuō)，當(dāng)局者迷，旁觀者清。您的客觀視角對(duì)于我們發(fā)現(xiàn)博文的不足、提升內(nèi)容質(zhì)量起著不可替代的作用。
如果您覺得我們的博文給您帶來(lái)了啟發(fā)，那么，希望您能為我們點(diǎn)個(gè)免費(fèi)的贊/關(guān)注，您的支持和鼓勵(lì)是我們持續(xù)創(chuàng)作的動(dòng)力。
請(qǐng)放心，我們會(huì)持續(xù)努力創(chuàng)作，并不斷優(yōu)化博文質(zhì)量，只為給您帶來(lái)更佳的閱讀體驗(yàn)。
再次感謝您的閱讀，愿我們共同成長(zhǎng)，共享智慧的果實(shí)！文章來(lái)源地址http://www.zghlxwxcb.cn/news/detail-764696.html

到了這里，關(guān)于已解決RuntimeError: CUDA error: invalid device ordinal CUDA kernel errors might be asynchronously repo的文章就介紹完了。如果您還想了解更多內(nèi)容，請(qǐng)?jiān)谟疑辖撬阉鱐OY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章，希望大家以后多多支持TOY模板網(wǎng)！

本文來(lái)自互聯(lián)網(wǎng)用戶投稿，該文觀點(diǎn)僅代表作者本人，不代表本站立場(chǎng)。本站僅提供信息存儲(chǔ)空間服務(wù)，不擁有所有權(quán)，不承擔(dān)相關(guān)法律責(zé)任。如若轉(zhuǎn)載，請(qǐng)注明出處：如若內(nèi)容造成侵權(quán)/違法違規(guī)/事實(shí)不符，請(qǐng)點(diǎn)擊違法舉報(bào)進(jìn)行投訴反饋，一經(jīng)查實(shí)，立即刪除！

分享到：

領(lǐng)支付寶紅包贊助服務(wù)器費(fèi)用

Bug小能手系列(python)_13: RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might
在運(yùn)行 Python 代碼時(shí)出現(xiàn)報(bào)錯(cuò)： RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. 注意：報(bào)錯(cuò)對(duì)應(yīng)的代碼部分與實(shí)際出現(xiàn)錯(cuò)誤的部分是不同的。具體報(bào)錯(cuò)
2024年02月04日
瀏覽(32)
RuntimeError: CUDA error: no kernel image is available for execution on the device
導(dǎo)致的原因一般都是顯卡算力和cuda或者torch版本不匹配比如在conda中安裝的pytorch=1.5.0 cuda=10.2 錯(cuò)誤：RuntimeError: CUDA error: no kernel image is available for execution on the device 參考pytorch 報(bào)錯(cuò) RuntimeError: CUDA error: no kernel image is available for execution on the device_可豌豆的博客-CSDN博客則應(yīng)該安裝
2024年02月15日
瀏覽(21)
當(dāng)出現(xiàn)RuntimeError:CUDA error:no kernel image is available for execution on the device 問題時(shí)候的pytorch安裝方法
當(dāng)出現(xiàn)一個(gè)明顯的特征就是出現(xiàn)： RuntimeError:CUDA error:no kernel image is av ailable for execution on the device 這就說(shuō)明你的顯卡太低了可以到這個(gè)路徑下C:Program FilesNVIDIA GPU Computing ToolkitCUDAv11.1extrasdemo_suite, 找到deviceQuenry.exe這個(gè)文件拖到cmd命令窗口運(yùn)行可以看到自身電腦的算力 ?從
2024年02月01日
瀏覽(58)
解決：RuntimeError: CUDA error: device-side assert triggered
@[TOC]解決辦法：RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at
2024年02月12日
瀏覽(22)
【RuntimeError: CUDA error: device-side assert triggered】問題與解決
當(dāng)我在調(diào)試模型的時(shí)候，出現(xiàn)了如下的問題 /opt/conda/conda-bld/pytorch_1656352465323/work/aten/src/ATen/native/cuda/IndexKernel.cu:91: operator(): block: [5,0,0], thread: [63,0,0] Assertion `index = -sizes[i] index sizes[i] “index out of bounds”` failed. 通過提示信息可以知道是個(gè)數(shù)組越界的問題。但是如圖一中第二行
2024年01月21日
瀏覽(23)
已解決RuntimeError: CUDA error: device-side assert triggered異常的正確解決方法，親測(cè)有效！??！
已解決RuntimeError: CUDA error: device-side assert triggered異常的正確解決方法，親測(cè)有效?。。?RuntimeError: CUDA error: device-side assert triggered 出現(xiàn) CUDA error: device-side assert triggered 錯(cuò)誤通常是由于 GPU 上的某些計(jì)算出現(xiàn)了問題，導(dǎo)致 CUDA 運(yùn)行時(shí)庫(kù)觸發(fā)了設(shè)備端斷言。下滑查看解決方法要解
2024年02月07日
瀏覽(19)
解決RuntimeError: CUDA error: no kernel image is available for execution on the deviceCUDA
解決RuntimeError: CUDA error: no kernel image is available for execution on the deviceCUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect. 在服務(wù)器復(fù)現(xiàn)代碼的時(shí)候，遇到了上述錯(cuò)誤，解決辦法如下。 .bashrc文件在服務(wù)器上初始頁(yè)面的配置文件的地方參考：
2024年02月16日
瀏覽(26)
AssertionError: Invalid CUDA ‘--device 0,1,2‘ requested, use ‘--device cpu‘ or pass valid CUDA devic
錯(cuò)誤： AssertionError: Invalid CUDA \\\'--device 0,1,2\\\' requested, use \\\'--device cpu\\\' or pass valid CUDA device(s) 在運(yùn)行yolov5時(shí)，出現(xiàn)這個(gè)錯(cuò)誤，意思是沒有可用的cuda，無(wú)法使用GPU訓(xùn)練。 1.首先用nvidia-smi查看是否真的有三張顯卡。很有可能是沒有這么多的。這里面的cuda–version的版本是最高支持的c
2024年02月11日
瀏覽(21)
RuntimeError: nms_impl: implementation for device cuda:0 not found.
RuntimeError: nms_impl: implementation for device cuda:0 not found. 關(guān)于mmpose的網(wǎng)頁(yè)搜索并不多，查了一些資料是cuda不匹配的問題，參考添加鏈接描述，后續(xù)檢查了自己配置，是匹配的。就卸載了mmcv-full ,在重新安裝，安裝命令是沒有后面的指定版本，運(yùn)行demo時(shí)成功！雖然卸載的和再重新
2024年02月13日
瀏覽(49)
CUDA kernel errors might be asynchronously reported at some other API call 錯(cuò)誤解決
CUDA kernel errors might be asynchronously reported at some other API call 在運(yùn)行基于pytorch的深度學(xué)習(xí)項(xiàng)目時(shí)，有時(shí)候會(huì)遇到上述錯(cuò)誤，并且在報(bào)錯(cuò)時(shí)沒有定位到正確的位置。這里查閱了很多網(wǎng)上的相關(guān)資料，說(shuō)是分類數(shù)目和模型里的實(shí)際分類數(shù)目不匹配，大家可以仔細(xì)查看一下這個(gè)。也有
2024年02月13日
瀏覽(15)