多機(jī)多卡訓(xùn)練mmseg工程時(shí),命令
第一臺(tái)機(jī)器:
NNODES=2 NODE_RANK=0 PORT=8888 MASTER_ADDR=192.168.XX.XX sh tools/dist_train.sh ./configs/temp.py 4
第二臺(tái)機(jī)器:
NNODES=2 NODE_RANK=1 PORT=8888 MASTER_ADDR=192.168.XX.XX sh tools/dist_train.sh ./configs/temp.py 4
報(bào)錯(cuò)信息如下:
RuntimeError: The server socket has failed to listen on any local network address. The server socket has failed to bind to [::]:8888 (errno: 98 - Address already in use). The server socket has failed to bind to ?UNKNOWN? (errno: 98 - Address already in use).
根據(jù)報(bào)錯(cuò)信息,可以看到是因?yàn)?888這個(gè)端口號(hào)被使用了?,此時(shí)只需要更換PORT的端口號(hào)就可以了,比如改成29050,29051......文章來源:http://www.zghlxwxcb.cn/news/detail-629069.html
至此,問題解決!文章來源地址http://www.zghlxwxcb.cn/news/detail-629069.html
整理不易,歡迎一鍵三連!??!
到了這里,關(guān)于【debug】mmseg多級(jí)多卡訓(xùn)練報(bào)錯(cuò):The server socket has failed to listen on any local network address.的文章就介紹完了。如果您還想了解更多內(nèi)容,請(qǐng)?jiān)谟疑辖撬阉鱐OY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章,希望大家以后多多支持TOY模板網(wǎng)!