大語(yǔ)言模型-中文chatGLM-LLAMA微調(diào)

2年前作者：YueTann分類：Toy博客閱讀(19)違法舉報(bào)

這篇具有很好參考價(jià)值的文章主要介紹了大語(yǔ)言模型-中文chatGLM-LLAMA微調(diào)。希望對(duì)大家有所幫助。如果存在錯(cuò)誤或未考慮完全的地方，請(qǐng)大家不吝賜教，您也可以點(diǎn)擊"舉報(bào)違法"按鈕提交疑問(wèn)。

微調(diào)
大語(yǔ)言模型-ChatGLM-Tuning
大語(yǔ)言模型-微調(diào)chatglm6b
大語(yǔ)言模型-中文chatGLM-LLAMA微調(diào)
大語(yǔ)言模型-alpaca-lora
本地知識(shí)庫(kù)
大語(yǔ)言模型2-document ai解讀
大語(yǔ)言模型-DocumentSearch解讀
大語(yǔ)言模型-中文Langchain

本文解讀代碼的地址：
https://github.com/27182812/ChatGLM-LLaMA-chinese-insturct

中文instruct在chatGLM, LLAMA上的表現(xiàn)

數(shù)據(jù)

json的預(yù)處理

instruction
tokenizer

相比大語(yǔ)言模型-ChatGLM-Tuning中，是兩個(gè)函數(shù)都放在了dataprocess的一個(gè)類中進(jìn)行，初步看起來(lái)需要改變的幾乎相同

微調(diào)

對(duì)chatGLM，finetune.sh
對(duì)LLAMA，test_llama1.py

對(duì)于chatGLM和之前文章幾乎相同，這里主要關(guān)注一下LLAMA

數(shù)據(jù)

def generate_prompt(data_point):
    # sorry about the formatting disaster gotta move fast
    if data_point["input"]:
        return f"""Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
### Instruction:
{data_point["instruction"]}
### Input:
{data_point["input"]}
### Response:
{data_point["output"]}"""
    else:
        return f"""Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
{data_point["instruction"]}
### Response:
{data_point["output"]}"""

def tokenize(prompt):
    # there's probably a way to do this with the tokenizer settings
    # but again, gotta move fast
    result = tokenizer(
        prompt,
        truncation=True,
        max_length=CUTOFF_LEN + 1,
        padding="max_length",
    )
    return {
        "input_ids": result["input_ids"][:-1],
        "attention_mask": result["attention_mask"][:-1],
    }

模型

model = LlamaForCausalLM.from_pretrained(
    "decapoda-research/llama-7b-hf",
    load_in_8bit=True,
    device_map="auto",
)
tokenizer = LlamaTokenizer.from_pretrained(
    "decapoda-research/llama-7b-hf", add_eos_token=True
)

model = prepare_model_for_int8_training(model)

config = LoraConfig(
    r=LORA_R,
    lora_alpha=LORA_ALPHA,
    target_modules=["q_proj", "v_proj"],
    lora_dropout=LORA_DROPOUT,
    bias="none",
    task_type="CAUSAL_LM",
)
model = get_peft_model(model, config)
tokenizer.pad_token_id = 0  # unk. we want this to be different from the eos token

微調(diào)

data = data.shuffle().map(lambda x: tokenize(generate_prompt(x)))

trainer = transformers.Trainer(
    model=model,
    train_dataset=data["train"],
    args=transformers.TrainingArguments(
        per_device_train_batch_size=MICRO_BATCH_SIZE,
        gradient_accumulation_steps=GRADIENT_ACCUMULATION_STEPS,
        warmup_steps=100,
        num_train_epochs=EPOCHS,
        learning_rate=LEARNING_RATE,
        fp16=True,
        logging_steps=20,
        output_dir="qys-alpaca-chinese",
        save_total_limit=3,
    ),
    data_collator=transformers.DataCollatorForLanguageModeling(tokenizer, mlm=False),
)
model.config.use_cache = False
trainer.train(resume_from_checkpoint=False)
# trainer.train()

model.save_pretrained("qys-alpaca-chinese")

推理

對(duì)chatGLM，infer.py
對(duì)LLAMA，generate_llama1.py

推理代碼文章來(lái)源地址http://www.zghlxwxcb.cn/news/detail-484350.html

tokenizer = LlamaTokenizer.from_pretrained("decapoda-research/llama-7b-hf")

model = LlamaForCausalLM.from_pretrained(
    "decapoda-research/llama-7b-hf",
    load_in_8bit=True,
    torch_dtype=torch.float16,
    device_map="auto",
)

model = PeftModel.from_pretrained( 
    model, "./qys-alpaca-chinese", torch_dtype=torch.float16
)

def generate_prompt(instruction, input=None):
    if input:
        return f"""Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
### Instruction:
{instruction}
### Input:
{input}
### Response:"""
    else:
        return f"""Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
{instruction}
### Response:"""


instructions = json.load(open("data/zh-data01.json"))

answers = []

with torch.no_grad():
    for idx, item in enumerate(instructions[12:18]):
        feature = format_example(item)
        input_text = feature['context']
        print(input_text)
        inputs = tokenizer(input_text, return_tensors="pt")
        input_ids = inputs["input_ids"].cuda()
        generation_config = GenerationConfig(
            temperature=0.1,
            top_p=0.75,
            top_k=40,
            num_beams=4,
        )
        generation_output = model.generate(
            input_ids=input_ids,
            generation_config=generation_config,
            return_dict_in_generate=True,
            output_scores=True,
            max_new_tokens=256,
        )
        s = generation_output.sequences[0]
        output = tokenizer.decode(s)
        print(output.strip())
        print("--------------------------------------------")

到了這里，關(guān)于大語(yǔ)言模型-中文chatGLM-LLAMA微調(diào)的文章就介紹完了。如果您還想了解更多內(nèi)容，請(qǐng)?jiān)谟疑辖撬阉鱐OY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章，希望大家以后多多支持TOY模板網(wǎng)！

本文來(lái)自互聯(lián)網(wǎng)用戶投稿，該文觀點(diǎn)僅代表作者本人，不代表本站立場(chǎng)。本站僅提供信息存儲(chǔ)空間服務(wù)，不擁有所有權(quán)，不承擔(dān)相關(guān)法律責(zé)任。如若轉(zhuǎn)載，請(qǐng)注明出處：如若內(nèi)容造成侵權(quán)/違法違規(guī)/事實(shí)不符，請(qǐng)點(diǎn)擊違法舉報(bào)進(jìn)行投訴反饋，一經(jīng)查實(shí)，立即刪除！

分享到：

領(lǐng)支付寶紅包贊助服務(wù)器費(fèi)用

LLMs：LLaMA Efficient Tuning(一款可高效微調(diào)【全參數(shù)/LoRA/QLoRA】主流大模型【ChatGLM2/LLaMA2/Baichuan等】的高效工具【預(yù)訓(xùn)練+指令監(jiān)督微調(diào)+
LLMs：LLaMA Efficient Tuning(一款可高效微調(diào)【全參數(shù)/LoRA/QLoRA】主流大模型【ChatGLM-2/LLaMA-2/Baichuan等】的高效工具【預(yù)訓(xùn)練+指令監(jiān)督微調(diào)+獎(jiǎng)勵(lì)模型訓(xùn)練+PPO 訓(xùn)練+DPO 訓(xùn)練】)的簡(jiǎn)介、安裝、使用方法之詳細(xì)攻略目錄相關(guān)文章 LLMs之ChatGLM：ChatGLM Efficient Tuning(一款高效微調(diào)ChatGLM-6B/Ch
2024年02月09日
瀏覽(33)
自然語(yǔ)言處理微調(diào)ChatGLM-6B大模型
bert的主要任務(wù)是隨機(jī)的去除掉某個(gè)單詞，使用上下文將其預(yù)測(cè)出來(lái)（相當(dāng)于完形填空任務(wù)）； GPT的主要任務(wù)是根據(jù)前面一句話，預(yù)測(cè)下面的內(nèi)容； GLM結(jié)合了bert的強(qiáng)大雙向注意力與gpt的強(qiáng)大生成能力兩種能力，被nask的地方使用單向注意力，未被mask的地方使用雙向注意力預(yù)測(cè)
2024年02月09日
瀏覽(21)
使用代碼下載開(kāi)源的大模型文件示例以及中文微調(diào)llama資源匯總：
一、下載示例? 二、資源匯總 Chinese Llama 2 7B 鏈接：LinkSoul/Chinese-Llama-2-7b · Hugging Face OpenBuddy-LLaMA2-13B 鏈接：OpenBuddy/openbuddy-llama2-13b-v8.1-fp16 · Hugging Face firefly-llama2-13b 鏈接：GitHub - yangjianxin1/Firefly: Firefly(流螢): 中文對(duì)話式大語(yǔ)言模型(全量微調(diào)+QLoRA)，支持微調(diào)Llma2、Llama、Qwen、
2024年02月13日
瀏覽(18)
哈工大團(tuán)隊(duì)開(kāi)源醫(yī)學(xué)智能問(wèn)診大模型 | 華佗: 基于中文醫(yī)學(xué)知識(shí)的LLaMa指令微調(diào)模型
本文首發(fā)至微信公眾號(hào)：CVHub，不得以任何形式轉(zhuǎn)載或售賣，僅供學(xué)習(xí)，違者必究！ Title: HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge PDF: https://arxiv.org/pdf/2304.06975v1.pdf Code: https://github.com/scir-hi/huatuo-llama-med-chinese 在生物醫(yī)學(xué)領(lǐng)域， LLM 模型（如 LLaMa ， ChatGLM ）因?yàn)槿狈σ欢ǖ?/p>
2024年02月13日
瀏覽(14)
類ChatGPT模型LLaMA的解讀與其微調(diào)：Alpaca-LoRA/Vicuna/BELLE/中文LLaMA/姜子牙
近期，除了研究ChatGPT背后的各種技術(shù)細(xì)節(jié) 不斷看論文(至少100篇，100篇目錄見(jiàn)此：ChatGPT相關(guān)技術(shù)必讀論文100篇)，還開(kāi)始研究一系列開(kāi)源模型(包括各自對(duì)應(yīng)的模型架構(gòu)、訓(xùn)練方法、訓(xùn)練數(shù)據(jù)、本地私有化部署、硬件配置要求、微調(diào)等細(xì)節(jié))? 本文一開(kāi)始是作為此文《ChatGPT技術(shù)
2024年02月16日
瀏覽(27)
【本地大模型部署與微調(diào)】ChatGLM3-6b、m3e、one-api、Fastgpt、LLaMA-Factory
本文檔詳細(xì)介紹了使用ChatGLM3-6b大模型、m3e向量模型、one-api接口管理以及Fastgpt的知識(shí)庫(kù)，成功的在本地搭建了一個(gè)大模型。此外，還利用LLaMA-Factory進(jìn)行了大模型的微調(diào)。 1.ChatGLM3-6b 2.m3e 3.One-API 4.Fastgpt 5.LLaMA-Factory 1.1創(chuàng)建騰訊云服務(wù)器注意: ChatGLM3-6b的大模型40多個(gè)G,購(gòu)買騰訊
2024年03月22日
瀏覽(33)
LLM-LLaMA中文衍生模型：Chinese-LLaMA-Alpaca【擴(kuò)充詞表、Lora部分參數(shù)預(yù)訓(xùn)練、微調(diào)】
GitHub：GitHub - ymcui/Chinese-LLaMA-Alpaca: 中文LLaMAAlpaca大語(yǔ)言模型+本地CPU/GPU訓(xùn)練部署 (Chinese LLaMA Alpaca LLMs) 中文LLaMA模型中文LLaMA模型在原版的基礎(chǔ)上擴(kuò)充了中文詞表，使用了中文通用純文本數(shù)據(jù)進(jìn)行二次預(yù)訓(xùn)練。模型名稱訓(xùn)練數(shù)據(jù) 重構(gòu)模型[1] 大小[2] LoRA下載[3] Chinese-LLaMA-7B 通用
2024年02月15日
瀏覽(23)
LLMs：LLaMA Efficient Tuning(一款可高效微調(diào)【全參數(shù)/LoRA/QLoRA】主流大模型【ChatGLM-2/LLaMA-2/Baichuan等】的高效工具【預(yù)訓(xùn)練+指令監(jiān)督微
LLMs：LLaMA Efficient Tuning(一款可高效微調(diào)【全參數(shù)/LoRA/QLoRA】主流大模型【ChatGLM-2/LLaMA-2/Baichuan等】的高效工具【預(yù)訓(xùn)練+指令監(jiān)督微調(diào)+獎(jiǎng)勵(lì)模型訓(xùn)練+PPO 訓(xùn)練+DPO 訓(xùn)練】)的簡(jiǎn)介、安裝、使用方法之詳細(xì)攻略目錄相關(guān)文章 LLMs之ChatGLM：ChatGLM Efficient Tuning(一款高效微調(diào)ChatGLM-6B/Ch
2024年02月08日
瀏覽(24)
《實(shí)戰(zhàn)AI模型》——趕上GPT3.5的大模型LLaMA 2可免費(fèi)商用，內(nèi)含中文模型推理和微調(diào)解決方案
目錄準(zhǔn)備環(huán)境及命令后參數(shù)導(dǎo)入：導(dǎo)入模型：準(zhǔn)備LoRA：導(dǎo)入datasets：配置
2024年02月16日
瀏覽(91)
LLM-LLaMA中文衍生模型：LLaMA-ZhiXi【沒(méi)有對(duì)詞表進(jìn)行擴(kuò)增、全參數(shù)預(yù)訓(xùn)練、部分參數(shù)預(yù)訓(xùn)練、指令微調(diào)】
下圖展示了我們的訓(xùn)練的整個(gè)流程和數(shù)據(jù)集構(gòu)造。整個(gè)訓(xùn)練過(guò)程分為兩個(gè)階段：（1）全量預(yù)訓(xùn)練階段。該階段的目的是增強(qiáng)模型的中文能力和知識(shí)儲(chǔ)備。（2）使用LoRA的指令微調(diào)階段。該階段讓模型能夠理解人類的指令并輸出合適的內(nèi)容。 ? 3.1 預(yù)訓(xùn)練數(shù)據(jù)集構(gòu)建為了在保
2024年02月12日
瀏覽(35)