国产无码综合区,色欲AV无码国产永久播放,无码天堂亚洲国产AV,国产日韩欧美女同一区二区

快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM

2年前作者：敷衍zgf分類：Toy博客閱讀(22)違法舉報(bào)

這篇具有很好參考價(jià)值的文章主要介紹了快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM。希望對大家有所幫助。如果存在錯(cuò)誤或未考慮完全的地方，請大家不吝賜教，您也可以點(diǎn)擊"舉報(bào)違法"按鈕提交疑問。

快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM

本項(xiàng)目采用HuggingFace提供的工具實(shí)現(xiàn)BERT模型案例，并在BERT后接CNN、LSTM等
HuggingFace官網(wǎng)
快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM

一、實(shí)現(xiàn)BERT（后接線性層）

1.引用案例源碼：

from transformers import BertTokenizer, BertModel
import torch
model_name = 'bert-base-uncased'

tokenizer = BertTokenizer.from_pretrained(model_name)
model = BertModel.from_pretrained(model_name)

inputs = tokenizer("Hello, my dog is cute", return_tensors="pt" , padding='max_length',max_length=10)
outputs = model(**inputs)
# print(inputs) # 字典類型的input_ids必選字段  101CLS  102SEP
last_hidden_states = outputs.last_hidden_state  # last_hidden_state 最后一層的輸出  pooler_output / hidden_states

快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM
程序會(huì)自行下載模型和配置文件，也可自行在官網(wǎng)上手動(dòng)下載

模型返回的參數(shù)

2. 自定義類調(diào)用數(shù)據(jù)集

class MyDataSet(Data.Dataset) :
    def __init__(self , data ,label):
        self.data = data # ['今天天氣很好', 'xxxx' , ……]
        self.label = label # [1 , 2 , 0]
        self.tokenizer = BertTokenizer.from_pretrained(model_name)
    def __getitem__(self , idx):
        text = self.data[idx] # str
        label = self.label[idx]
        inputs = self.tokenizer(text , return_tensors="pt" , padding='max_length',max_length=10 , truncation = True)  # truncation = True 是否進(jìn)行截?cái)嗖僮?/span>
        input_ids = inputs.input_ids.squeeze(0) # squeeze(0) 對張量進(jìn)行降維操作 為啥降維：輸入的data是一句話（一維）但生成的input_ids默認(rèn)是二維，因此需要降維
        token_type_ids = inputs.token_type_ids.squeeze(0)
        attention_mask = inputs.attention_mask.squeeze(0)
        return input_ids , token_type_ids , attention_mask,label
    def __len__(self):
        return len(self.data)

squeeze(0)的作用： 舉個(gè)栗子

input_ids: tensor([[ 101, 7592, 1010, 2026, 3899, 2003, 10140, 102, 0, 0]])
b = input_ids.squeeze(0)
b: tensor([ 101, 7592, 1010, 2026, 3899, 2003, 10140, 102, 0, 0])
當(dāng)張量是一個(gè)1 * n 維度的張量時(shí)，input_ids的維度是 1 * 10，調(diào)用squeeze(0) 將張量降成1維。
若不是1 * n的這種2維張量，如本就是1維的，或者m * n（其中m和n都是大于1的），調(diào)用這個(gè)函數(shù)沒有效果。

squeeze(1)和squeeze(-1)作用相同 ，與squeeze(0)類似
將一個(gè)n*1維度的張量降成1維

3. 將數(shù)據(jù)集傳入模型

data , label = [] , []
with open('./dataset/data.txt') as f:
    for line in f.readlines():
        a, b = line.strip().split(' ')
        data.append(a)
        label.append(int(b))
dataset = MyDataSet(data,label)
dataloader = Data.DataLoader(dataset , batch_size = 2,shuffle =True)

4.自定義模型

class MyModel(nn.Module):
    def __init__(self):
        super(MyModel,self).__init__()
        self.bert = BertModel.from_pretrained(model_name)
        self.liner = nn.Linear(768, 3)  # "hidden_size": 768
    
    def forward(self , input_ids , token_type_ids , attention_mask) :
        output = self.bert(input_ids , token_type_ids ,attention_mask).pooler_output # [batch_size , hidden_size]
        # print(output.shape)
        output = self.linnear(output)
        return output

5.配置運(yùn)行環(huán)境

device = torch.device('cuda')
model = MyModel().to(device)
loss_fun = nn.CrossEntropyLoss()
optimizer = optim.Adam(model.parameters() , lr = 1e-5)

6.訓(xùn)練模型，計(jì)算損失

for epoch in range(10):
    for input_ids,token_type_ids,attention_mask ,label in dataloader:
        input_ids,token_type_ids,attention_mask ,label = input_ids.to(device),token_type_ids.to(device),attention_mask.to(device) ,label.to(device)
        pred = model(input_ids,token_type_ids,attention_mask)
        
        loss = loss_fn(pred , label)
        print(loss.item())
        
        loss.backward()
        optimizer.step()
        optimizer.zero_grad()

快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM
易出現(xiàn)顯存不夠的錯(cuò)誤，可以在服務(wù)器控制臺中輸入nvidia-smi //查看所有進(jìn)程信息

選擇需要?dú)⑺赖倪M(jìn)程taskkill /PID PTD號 /F //使用taskkill 進(jìn)程id，殺死進(jìn)程
快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM

二、BERT+CNN

添加卷積層，查看需要的參數(shù)
快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM

輸入層和輸出層之間的參數(shù)關(guān)系為：W_out = （W_in + 2p - w）/ s +1 ； H_out = （H_in + 2p - w）/ s +1
其中W_in = maxlen，H_in = hidden_size，卷積核大小為 w * h ，p表示padding（默認(rèn)為0），s表示卷積步長（默認(rèn)為1）
因此輸出為（10 + 2 * 0 - 2）/ 1 + 1 = 9 ，（768 + 2 * 0 - 768）/ 1 + 1 = 1

class MyModel(nn.Module):
    def __init__(self):
        super(MyModel, self).__init__()
        self.bert = BertModel.from_pretrained(model_name)

        '''
        BERT后接CNN
        '''
        self.conv = nn.Conv2d(1 ,3 ,kernel_size=(2,768)) # in_channels 輸入的通道數(shù) out_channels 經(jīng)過卷積之后的通道數(shù) kernel_size 卷積核大小
        self.linear = nn.Linear(27, 3)  # "hidden_size": 768

    def forward(self, input_ids, token_type_ids, attention_mask):
        '''

        x : [batch , channel , width , height]

        '''
        batch = input_ids.size(0)
        output = self.bert(input_ids, token_type_ids, attention_mask).last_hidden_state  # [batch_size , seq , hidden_size]
        output = output.unsqueeze(1) # [batch , 1, seq , hidden_size] 三維擴(kuò)展成四維
        # print(output.shape)
        output = self.conv(output) # [batch , 3, 9 ,1]
        print(output.shape)
        # 為了進(jìn)行分類，希望將四維轉(zhuǎn)換成二維 # [batch , 3]
        output = output.view(batch , -1) # [batch , 3*9*1]

        output = self.linear(output)
        return output

輸出結(jié)果

torch.Size([2, 3, 9, 1])
1.0467640161514282
torch.Size([1, 3, 9, 1])
1.6651103496551514
torch.Size([2, 3, 9, 1])
1.1516715288162231
torch.Size([1, 3, 9, 1])
1.0645687580108643
torch.Size([2, 3, 9, 1])
1.0910512208938599
torch.Size([1, 3, 9, 1])
0.9897172451019287
torch.Size([2, 3, 9, 1])
1.0313527584075928
torch.Size([1, 3, 9, 1])
1.0067516565322876
torch.Size([2, 3, 9, 1])
0.9847115278244019
torch.Size([1, 3, 9, 1])
1.01240873336792
torch.Size([2, 3, 9, 1])
0.9597381353378296
torch.Size([1, 3, 9, 1])
0.9435619115829468
torch.Size([2, 3, 9, 1])
0.9591015577316284
torch.Size([1, 3, 9, 1])
0.8384571075439453
torch.Size([2, 3, 9, 1])
0.9722234010696411
torch.Size([1, 3, 9, 1])
0.7264331579208374
torch.Size([2, 3, 9, 1])
0.9841375350952148
torch.Size([1, 3, 9, 1])
0.6240622997283936
torch.Size([2, 3, 9, 1])
0.7659112811088562
torch.Size([1, 3, 9, 1])
1.0371975898742676

三、BERT+LSTM

添加LSTM，查看需要哪些參數(shù)
快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM

class MyModel(nn.Module):
    def __init__(self):
        super(MyModel, self).__init__()
        self.bert = BertModel.from_pretrained(model_name)
        '''
        BERT后接LSTM
        '''
        self.lstm = nn.LSTM(input_size=768, hidden_size= 512 ,num_layers= 1 , batch_first= True , bidirectional=True) # batch_first = True 表示輸入輸出順序(batch,seq,feature) LSTM默認(rèn)(seq,batch,feature)
        self.linear = nn.Linear(1024, 3)  # "hidden_size": 768

    def forward(self, input_ids, token_type_ids, attention_mask):
        '''
        x : [batch , seq]
        '''
        batch = input_ids.size(0)
        output = self.bert(input_ids, token_type_ids, attention_mask).last_hidden_state  # [batch_size , seq , hidden_size]
        output , _ = self.lstm(output)
        print(output.shape) # [2 , seq ,2*hidden_size]

        # 使用LSTM最后一層的輸出做分類
        output = output[: ,-1,:] # [batch , 1024]
        print('最后一層' ,output.shape)
        output = self.linear(output)
        return output

輸出結(jié)果

{‘input_ids’: tensor([[ 101, 7592, 1010, 2026, 3899, 2003,
10140, 102, 0, 0]]), ‘token_type_ids’: tensor([[0, 0, 0, 0,
0, 0, 0, 0, 0, 0]]), ‘a(chǎn)ttention_mask’: tensor([[1, 1, 1, 1, 1, 1, 1,
1, 0, 0]])} [‘今天天氣很好’, ‘今天天氣很不好’, ‘明天天氣非常好’] [1, 0, 2]

torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
1.0788244009017944 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
1.3834939002990723 torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
1.155088186264038 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
1.0809415578842163 torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
1.061639666557312 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
1.1302376985549927 torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
1.0572789907455444 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
1.086378812789917 torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
1.0700803995132446 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
1.0184061527252197 torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
0.9948051571846008 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
1.203598976135254 torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
1.1068116426467896 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
0.9117098450660706 torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
0.9891176223754883 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
1.1974778175354004 torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
1.0810655355453491 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
0.8861477375030518 torch.Size([2, 10, 1024]) 最后一層 torch.Size([2, 1024])
0.9180283546447754 torch.Size([1, 10, 1024]) 最后一層 torch.Size([1, 1024])
1.2292695045471191

要實(shí)現(xiàn)BERT后接各種模型，最重要的是需要知道模型經(jīng)過每一層后的維度是多少，最粗暴的方式可以通過print輸出維度。文章來源地址http://www.zghlxwxcb.cn/news/detail-454831.html

到了這里，關(guān)于快速上手Pytorch實(shí)現(xiàn)BERT，以及BERT后接CNN/LSTM的文章就介紹完了。如果您還想了解更多內(nèi)容，請?jiān)谟疑辖撬阉鱐OY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章，希望大家以后多多支持TOY模板網(wǎng)！

本文來自互聯(lián)網(wǎng)用戶投稿，該文觀點(diǎn)僅代表作者本人，不代表本站立場。本站僅提供信息存儲空間服務(wù)，不擁有所有權(quán)，不承擔(dān)相關(guān)法律責(zé)任。如若轉(zhuǎn)載，請注明出處：如若內(nèi)容造成侵權(quán)/違法違規(guī)/事實(shí)不符，請點(diǎn)擊違法舉報(bào)進(jìn)行投訴反饋，一經(jīng)查實(shí)，立即刪除！

分享到：

領(lǐng)支付寶紅包贊助服務(wù)器費(fèi)用

基于Pytorch框架的CNN-LSTM模型在CWRU軸承故障診斷的應(yīng)用
目錄 1. 簡介 2. 方法 2.1數(shù)據(jù)集 2.2模型架構(gòu) 1. 簡介 CWRU軸承故障診斷是工業(yè)領(lǐng)域一個(gè)重要的問題，及早發(fā)現(xiàn)軸承故障可以有效地減少設(shè)備停機(jī)時(shí)間和維修成本，提高生產(chǎn)效率和設(shè)備可靠性。傳統(tǒng)的基于信號處理和特征提取的方法通常需要手工設(shè)計(jì)特征，這在某些情況下可能無法
2024年04月15日
瀏覽(26)
人工智能(pytorch)搭建模型16-基于LSTM+CNN模型的高血壓預(yù)測的應(yīng)用
大家好，我是微學(xué)AI，今天給大家介紹一下人工智能(pytorch)搭建模型16-基于LSTM+CNN模型的高血壓預(yù)測的應(yīng)用，LSTM+CNN模型搭建與訓(xùn)練，本項(xiàng)目將利用pytorch搭建LSTM+CNN模型，涉及項(xiàng)目：高血壓預(yù)測，高血壓是一種常見的性疾病，早期預(yù)測和干預(yù)對于防止其發(fā)展至嚴(yán)重疾病至關(guān)重要
2024年02月12日
瀏覽(102)
最強(qiáng)Python開源庫PyTorch入門實(shí)戰(zhàn)(案例實(shí)戰(zhàn))+快速上手TorchServe
作者：禪與計(jì)算機(jī)程序設(shè)計(jì)藝術(shù) 在過去幾年里，深度學(xué)習(xí)領(lǐng)域涌現(xiàn)了一大批高水平的模型，這些模型基于大量的數(shù)據(jù)和GPU計(jì)算能力實(shí)現(xiàn)了炫酷的效果。這其中最具代表性的是卷積神經(jīng)網(wǎng)絡(luò)（Convolutional Neural Networks, CNN），其網(wǎng)絡(luò)結(jié)構(gòu)可以學(xué)習(xí)到圖像、視頻、文本等多種模態(tài)特
2024年02月07日
瀏覽(28)
分類預(yù)測 | MATLAB實(shí)現(xiàn)SCNGO-CNN-LSTM-Attention數(shù)據(jù)分類預(yù)測
分類效果基本描述 1.SCNGO-CNN-LSTM-Attention數(shù)據(jù)分類預(yù)測程序，改進(jìn)算法，融合正余弦和折射反向?qū)W習(xí)的北方蒼鷹優(yōu)化算法； 2.程序平臺：無Attention適用于MATLAB 2020版及以上版本；融合Attention要求Matlab2023版以上； 3.基于融合正余弦和折射反向?qū)W習(xí)的北方蒼鷹優(yōu)化算法（SCNGO）、卷
2024年02月11日
瀏覽(30)
基于貝葉斯優(yōu)化CNN-LSTM混合神經(jīng)網(wǎng)絡(luò)預(yù)測（Matlab代碼實(shí)現(xiàn)）
?? ?? ?? ?? 歡迎來到本博客 ?? ?? ?? ?? ?? 博主優(yōu)勢： ?? ?? ??博客內(nèi)容盡量做到思維縝密，邏輯清晰，為了方便讀者。 ? 座右銘：行百里者，半于九十。 ?? ?? ?? 本文目錄如下： ?? ?? ?? 目錄 ??1 概述 ??2 運(yùn)行結(jié)果 ??3 參考文獻(xiàn) ??4 Matlab代碼實(shí)現(xiàn) 參
2024年02月02日
瀏覽(26)
回歸預(yù)測 | MATLAB實(shí)現(xiàn)CNN-LSTM-Attention多輸入單輸出回歸預(yù)測
預(yù)測效果基本介紹 MATLAB實(shí)現(xiàn)CNN-LSTM-Attention多輸入單輸出回歸預(yù)測，CNN-LSTM結(jié)合注意力機(jī)制多輸入單輸出回歸預(yù)測。模型描述 Matlab實(shí)現(xiàn)CNN-LSTM-Attention多變量回歸預(yù)測 1.data為數(shù)據(jù)集，格式為excel，7個(gè)輸入特征，1個(gè)輸出特征； 2.MainCNN_LSTM_Attention.m為主程序文件，運(yùn)行即可； 3.命
2024年02月06日
瀏覽(25)
時(shí)間序列預(yù)測 — CNN-LSTM-Attention實(shí)現(xiàn)多變量負(fù)荷預(yù)測(Tensorflow)：多變量滾動(dòng)
???專欄鏈接： https://blog.csdn.net/qq_41921826/category_12495091.html 專欄內(nèi)容 ??所有文章提供源代碼、數(shù)據(jù)集、效果可視化 ??文章多次上領(lǐng)域內(nèi)容榜、每日必看榜單、全站綜合熱榜 ? ? ?
2024年01月23日
瀏覽(19)
【人工智能】Transformers 快速上手: 為 Jax、PyTorch 和 TensorFlow 打造的先進(jìn)的自然語言處理
為 Jax、PyTorch 和 TensorFlow 打造的先進(jìn)的自然語言處理 ?? Transformers 提供了數(shù)以千計(jì)的預(yù)訓(xùn)練模型，支持 100 多種語言的文本分類、信息抽取、問答、摘要、翻譯、文本生成。它的宗旨是讓最先進(jìn)的 NLP 技術(shù)人人易用。 ?? Transformers 提供了便于快速下載和使用的API，讓你可以把
2024年02月08日
瀏覽(31)
LSTM實(shí)現(xiàn)時(shí)間序列預(yù)測(PyTorch版)
??項(xiàng)目專欄：【深度學(xué)習(xí)時(shí)間序列預(yù)測案例】零基礎(chǔ)入門經(jīng)典深度學(xué)習(xí)時(shí)間序列預(yù)測項(xiàng)目實(shí)戰(zhàn)（附代碼+數(shù)據(jù)集+原理介紹）
2023年04月24日
瀏覽(27)
時(shí)序預(yù)測 | MATLAB實(shí)現(xiàn)CNN-LSTM卷積長短期記憶神經(jīng)網(wǎng)絡(luò)時(shí)間序列預(yù)測（風(fēng)電功率預(yù)測）
預(yù)測效果基本介紹 1.MATLAB實(shí)現(xiàn)CNN-LSTM卷積長短期記憶神經(jīng)網(wǎng)絡(luò)時(shí)間序列預(yù)測（風(fēng)電功率預(yù)測）； 2.運(yùn)行環(huán)境為Matlab2021b； 3.單個(gè)變量時(shí)間序列預(yù)測； 4.data為數(shù)據(jù)集，單個(gè)變量excel數(shù)據(jù)，MainCNN_LSTMTS.m為主程序，運(yùn)行即可,所有文件放在一個(gè)文件夾； 5.命令窗口輸出R2、MSE、RMSE、
2024年02月10日
瀏覽(35)

<strike id="h4bxc"><strike id="h4bxc"></strike></strike>

<strike id="h4bxc"><dl id="h4bxc"></dl></strike>