国产无码综合区,色欲AV无码国产永久播放,无码天堂亚洲国产AV,国产日韩欧美女同一区二区

<mark id="a4vu9"><pre id="a4vu9"><small id="a4vu9"></small></pre></mark>

<pre id="a4vu9"><em id="a4vu9"><kbd id="a4vu9"></kbd></em></pre>

<kbd id="a4vu9"><pre id="a4vu9"><u id="a4vu9"></u></pre></kbd>

<mark id="a4vu9"><pre id="a4vu9"><u id="a4vu9"></u></pre></mark>

<mark id="a4vu9"><pre id="a4vu9"></pre></mark>

<kbd id="a4vu9"></kbd>

測試大語言模型在嵌入式設(shè)備部署的可能性——模型TinyLlama-1.1B-Chat-v1.0

1年前作者：noedn分類：Toy博客閱讀(26)違法舉報

這篇具有很好參考價值的文章主要介紹了測試大語言模型在嵌入式設(shè)備部署的可能性——模型TinyLlama-1.1B-Chat-v1.0。希望對大家有所幫助。如果存在錯誤或未考慮完全的地方，請大家不吝賜教，您也可以點擊"舉報違法"按鈕提交疑問。

測試模型TinyLlama-1.1B-Chat-v1.0修改推理參數(shù)，觀察參數(shù)變化與推理時間變化之間的關(guān)系。
本地環(huán)境：

處理器 Intel? Core? i5-8400 CPU @ 2.80GHz 2.80 GHz
機帶 RAM 16.0 GB (15.9 GB 可用)
集顯 Intel? UHD Graphics 630
獨顯 NVIDIA GeForce GTX 1050

主要測試修改：

outputs = pipe(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)

源代碼來源（鏡像）：https://hf-mirror.com/TinyLlama/TinyLlama-1.1B-Chat-v1.0文章來源地址http://www.zghlxwxcb.cn/news/detail-861458.html

'''
https://hf-mirror.com/TinyLlama/TinyLlama-1.1B-Chat-v1.0
測試tinyLlama 1.1B效果不錯，比Qwen1.8B經(jīng)過量化的都好很多
'''

# Install transformers from source - only needed for versions <= v4.34
# pip install git+https://github.com/huggingface/transformers.git
# pip install accelerate

import os
from datetime import datetime
import torch

os.environ['TF_ENABLE_ONEDNN_OPTS'] = '0'
from transformers import pipeline

'''
pipe = pipeline("text-generation", model="TinyLlama/TinyLlama-1.1B-Chat-v1.0", torch_dtype=torch.bfloat16, device_map="auto")

# We use the tokenizer's chat template to format each message - see https://hf-mirror.com/docs/transformers/main/en/chat_templating
messages = [
    {
        "role": "system",
        "content": "You are a friendly chatbot who always responds in the style of a pirate",
    },
    # {"role": "user", "content": "How many helicopters can a human eat in one sitting?"},
    {"role": "user", "content": "你叫什么名字?"},
]
prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
outputs = pipe(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])
'''

# <|system|>
# You are a friendly chatbot who always responds in the style of a pirate.</s>
# <|user|>
# How many helicopters can a human eat in one sitting?</s>
# <|assistant|>
# ...
def load_pipeline():
    pipe = pipeline("text-generation", model="TinyLlama/TinyLlama-1.1B-Chat-v1.0", torch_dtype=torch.bfloat16,
                    device_map="auto")
    return pipe

def generate_text(content, length=20):
    """
    根據(jù)給定的prompt生成文本
    """
    messages = [
        {
            "role": "提示",
            "content": "這是個友好的聊天機器人...",
        },
        # {"role": "user", "content": "How many helicopters can a human eat in one sitting?"},
        {"role": "user", "content": content},
    ]
    prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
    datetime1 = datetime.now()
    outputs = pipe(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
    print(outputs[0]["generated_text"])
    datetime2 = datetime.now()
    time12_interval = datetime2 - datetime1
    print("時間間隔", time12_interval)
    if False:
        outputs = pipe(prompt, max_new_tokens=32, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
        print(outputs[0]["generated_text"])
        datetime3 = datetime.now()
        time23_interval = datetime3 - datetime2
        print("時間間隔2", time23_interval)
        outputs = pipe(prompt, max_new_tokens=32, do_sample=False, top_k=50)
        print(outputs[0]["generated_text"])
        datetime4 = datetime.now()
        time34_interval = datetime4 - datetime3
        print("時間間隔3", time34_interval)
        outputs = pipe(prompt, max_new_tokens=32, do_sample=True, temperature=0.7, top_k=30, top_p=0.95)
        print(outputs[0]["generated_text"])
        datetime5 = datetime.now()
        time45_interval = datetime5 - datetime4
        print("時間間隔4", time45_interval)
        outputs = pipe(prompt, max_new_tokens=32, do_sample=False, top_k=30)
        print(outputs[0]["generated_text"])
        datetime6 = datetime.now()
        time56_interval = datetime6 - datetime5
        print("時間間隔5", time56_interval)
        outputs = pipe(prompt, max_new_tokens=12, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
        print(outputs[0]["generated_text"])
        datetime7 = datetime.now()
        time67_interval = datetime7 - datetime6
        print("時間間隔6", time67_interval)

    '''
    結(jié)論：修改top_p不會顯著降低推理時間，并且中英文相同的問題，中文問題推理時間是英文的兩倍
    do_sample修改成False基本不會降低推理時間
    只有max_new_tokens才能顯著降低推理時間，但是max_new_tokens與推理時間不是呈線性關(guān)系
    比如max_new_tokens=256，推理時間2分鐘
    當(dāng)max_new_tokens=32的時候，推理時間才會變成約1分鐘
    因此，不如將max_new_tokens設(shè)置大些用于獲取比較完整的答案
    '''

    return outputs

if __name__ == "__main__":
    '''
    main function
    '''
    global pipe
    pipe = load_pipeline()

    # print('load pipe ok')

    while True:
        prompt = input("請輸入一個提示（或輸入'exit'退出）：")
        if prompt.lower() == 'exit':
            break
        try:
            generated_text = generate_text(prompt)
            print("生成的文本：")
            print(generated_text[0]["generated_text"])
        except Exception as e:
            print("發(fā)生錯誤：", e)

請輸入一個提示（或輸入'exit'退出）：如何開門？
<|user|>
如何開門？</s>
<|assistant|>
Certainly! Opening a door is a simple process that involves several steps. Here are the general steps to follow to open a door:

1. Turn off the lock: Turn off the lock with the key by pressing the "lock" button.

2. Press the handle: Use the handle to push the door open. If the door is mechanical, you may need to turn a knob or pull the door handle to activate the door.

3. Release the latch: Once the door is open, release the latch by pulling it backward.

4. Slide the door: Slide the door forward by pushing it against the wall with your feet or using a push bar.

5. Close the door: Once the door is open, close it by pressing the lock button or pulling the handle backward.

6. Use a second key: If the lock has a second key, make sure it is properly inserted and then turn it to the correct position to unlock the door.

Remember to always double-check the locks before opening a door, as some locks can be tricky to open. If you're unsure about the correct procedure for opening a door,
時間間隔 0:04:23.561065
生成的文本：
<|user|>
如何開門？</s>
<|assistant|>
Certainly! Opening a door is a simple process that involves several steps. Here are the general steps to follow to open a door:

1. Turn off the lock: Turn off the lock with the key by pressing the "lock" button.

2. Press the handle: Use the handle to push the door open. If the door is mechanical, you may need to turn a knob or pull the door handle to activate the door.

3. Release the latch: Once the door is open, release the latch by pulling it backward.

4. Slide the door: Slide the door forward by pushing it against the wall with your feet or using a push bar.

5. Close the door: Once the door is open, close it by pressing the lock button or pulling the handle backward.

6. Use a second key: If the lock has a second key, make sure it is properly inserted and then turn it to the correct position to unlock the door.

Remember to always double-check the locks before opening a door, as some locks can be tricky to open. If you're unsure about the correct procedure for opening a door,
請輸入一個提示（或輸入'exit'退出）：

到了這里，關(guān)于測試大語言模型在嵌入式設(shè)備部署的可能性——模型TinyLlama-1.1B-Chat-v1.0的文章就介紹完了。如果您還想了解更多內(nèi)容，請在右上角搜索TOY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章，希望大家以后多多支持TOY模板網(wǎng)！

本文來自互聯(lián)網(wǎng)用戶投稿，該文觀點僅代表作者本人，不代表本站立場。本站僅提供信息存儲空間服務(wù)，不擁有所有權(quán)，不承擔(dān)相關(guān)法律責(zé)任。如若轉(zhuǎn)載，請注明出處：如若內(nèi)容造成侵權(quán)/違法違規(guī)/事實不符，請點擊違法舉報進行投訴反饋，一經(jīng)查實，立即刪除！

分享到：

領(lǐng)支付寶紅包贊助服務(wù)器費用

【IoT】嵌入式Linux開發(fā)：網(wǎng)絡(luò)設(shè)備開發(fā)（測試題）
目錄網(wǎng)絡(luò)開發(fā) 選擇題 1、路由器工作在哪一層（B）
2024年02月06日
瀏覽(26)
韋東山嵌入式Liunx入門驅(qū)動開發(fā)一(Hello 驅(qū)動編程、GPIO基礎(chǔ)知識、LED驅(qū)動、總線設(shè)備驅(qū)動模型)
本人學(xué)習(xí)完韋老師的視頻，因此來復(fù)習(xí)鞏固，寫以筆記記之。韋老師的課比較難，第一遍不知道在說什么，但是堅持看完一遍，再來復(fù)習(xí)，基本上就水到渠成了。看完視頻復(fù)習(xí)的同學(xué)觀看最佳！基于 IMX6ULL-PRO 參考視頻 Linux快速入門到精通視頻參考資料：01_嵌入式Linux應(yīng)用
2024年04月25日
瀏覽(96)
C語言與嵌入式系統(tǒng)測試：單元測試、集成測試與硬件在環(huán)(HIL)測試方法（一）
目錄一、引言二、C語言環(huán)境下的單元測試單元測試定義與目標(biāo) C語言單元測試工具與框架 C語言單元測試實踐 C語言作為一門歷史悠久且廣泛應(yīng)用的編程語言，在嵌入式系統(tǒng)開發(fā)領(lǐng)域扮演著無可替代的角色。其簡潔高效的語法、貼近硬件的特性、高度的可移植性以及豐富的編
2024年04月26日
瀏覽(27)
C語言與嵌入式系統(tǒng)測試：單元測試、集成測試與硬件在環(huán)(HIL)測試方法（二）
目錄二、C語言環(huán)境下的集成測試集成測試定義與目標(biāo) C語言集成測試策略 C語言環(huán)境下的模塊依賴管理 C語言集成測試實踐二、硬件在環(huán)（HIL）測試方法 HIL測試定義與原理 C語言環(huán)境下HIL測試實踐三、結(jié)論重要角色與相互關(guān)系發(fā)展趨勢與建議鼓勵與展望集成測試定義與目
2024年04月28日
瀏覽(21)
【C C++開源庫】適合單片機嵌入式的C語言單元測試庫_單片機單元測試框架
#define TEST_ASSERT_LESS_THAN_UINT64(threshold, actual) UNITY_TEST_ASSERT_SMALLER_THAN_UINT64((threshold), (actual), __LINE__, NULL) #define TEST_ASSERT_LESS_THAN_size_t(threshold, actual) UNITY_TEST_ASSERT_SMALLER_THAN_UINT((threshold), (actual), __LINE__, NULL) #define TEST_ASSERT_LESS_THAN_HEX8(threshold, actual) UNITY_TEST_ASSERT_SMALLER_THAN_HEX8((thres
2024年04月25日
瀏覽(31)
可移動嵌入式設(shè)備
? 可移動嵌入式設(shè)備是數(shù)據(jù)客戶端的一種表現(xiàn)形式。軟件的代碼編寫之后是運行在服務(wù)器之上，?? 服務(wù)器的數(shù)據(jù)為客戶端提供服務(wù)的模式為服務(wù)器客戶端模式，server2client 架構(gòu)。服務(wù)器可以是大型的機器，也可以是小型機，主要看數(shù)據(jù)處理量和用戶量的大小。一臺計算機其實
2024年02月08日
瀏覽(15)
【嵌入式AI部署神經(jīng)網(wǎng)絡(luò)】STM32CubeIDE上部署神經(jīng)網(wǎng)絡(luò)之指紋識別（Pytorch）——篇一|環(huán)境搭建與模型初步部署篇
前言：本篇主要講解搭建所需環(huán)境，以及基于pytorch框架在stm32cubeide上部署神經(jīng)網(wǎng)絡(luò)，部署神經(jīng)網(wǎng)絡(luò)到STM32單片機，本篇實現(xiàn)初步部署模型，沒有加入訓(xùn)練集與驗證集，將在第二篇加入。篇二詳細(xì)講解STM32CubeIDE上部署神經(jīng)網(wǎng)絡(luò)之指紋識別（Pytorch）的數(shù)據(jù)準(zhǔn)備和模型訓(xùn)練過程等
2024年04月25日
瀏覽(21)
嵌入式開發(fā)，如何防止設(shè)備被抄襲？
在國內(nèi)做產(chǎn)品設(shè)計開發(fā)，很難避免被抄襲，被仿照。在沒有形成技術(shù)壁壘之前，如何防止產(chǎn)品被抄襲是一個不可回避的問題。常規(guī)設(shè)備主要的防護手段有：專利保護加密保護代碼授權(quán)校驗持續(xù)更新和改進對于一些比較重要的技術(shù)發(fā)明或是創(chuàng)新，應(yīng)該盡快申請專利。雖然目
2024年02月08日
瀏覽(30)
【小黑嵌入式系統(tǒng)第二課】嵌入式系統(tǒng)的概述（二）——外圍設(shè)備、處理器、ARM
板級支持包(BSP) 是商用嵌入式操作系統(tǒng)實現(xiàn)可移植性所采用的一種方案，是硬件抽象層的一種實現(xiàn)。BSP是介于硬件和操作系統(tǒng)中驅(qū)動層程序之間的一層，有時也可認(rèn)為屬于操作系統(tǒng)一部分。BSP實現(xiàn)了對操作系統(tǒng)的支持，為上層的驅(qū)動程序提供訪問硬件設(shè)備的函數(shù)包。 BSP隔離了
2024年04月17日
瀏覽(25)
嵌入式Linux（8）：字符設(shè)備驅(qū)動--注冊字符類設(shè)備
雜項設(shè)備注冊雜項設(shè)備：注銷雜項設(shè)備：字符類設(shè)備文件：include/linux/cdev.h 步驟流程：定義一個cdev結(jié)構(gòu)體。使用cdev_init函數(shù)初始化cdev結(jié)構(gòu)體成員變量。參數(shù)：第一個：要初始化的cdev結(jié)構(gòu)體第二個：文件操作集： cdev-ops = fops;//實際就是把文件操作集寫ops 使用cdev_add函數(shù)
2023年04月22日
瀏覽(24)

<optgroup id="sn8qi"></optgroup>

<optgroup id="sn8qi"><dfn id="sn8qi"><input id="sn8qi"></input></dfn></optgroup>