国产无码综合区,色欲AV无码国产永久播放,无码天堂亚洲国产AV,国产日韩欧美女同一区二区

AI實(shí)戰(zhàn)營(yíng)：生成模型+底層視覺(jué)+AIGC多模態(tài) 算法庫(kù)MMagic

2年前作者：guwuyue分類(lèi)：Toy博客閱讀(55)違法舉報(bào)

這篇具有很好參考價(jià)值的文章主要介紹了AI實(shí)戰(zhàn)營(yíng)：生成模型+底層視覺(jué)+AIGC多模態(tài) 算法庫(kù)MMagic。希望對(duì)大家有所幫助。如果存在錯(cuò)誤或未考慮完全的地方，請(qǐng)大家不吝賜教，您也可以點(diǎn)擊"舉報(bào)違法"按鈕提交疑問(wèn)。

?環(huán)境安裝

黑白照片上色

文生圖-Stable Diffusion

?文生圖-Dreambooth

圖生圖-ControlNet-Canny

圖生圖-ControlNet-Pose

圖生圖-ControlNet Animation

訓(xùn)練自己的ControlNet

AI實(shí)戰(zhàn)營(yíng)：生成模型+底層視覺(jué)+AIGC多模態(tài) 算法庫(kù)MMagic

?環(huán)境安裝

mim install mmagic

pip install opencv-python pillow matplotlib seaborn tqdm -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install clip transformers gradio 'httpx[socks]' diffusers==0.14.0 -i https://pypi.tuna.tsinghua.edu.cn/simple
mim install 'mmdet>=3.0.0'

# 檢查 Pytorch
import torch, torchvision
print('Pytorch 版本', torch.__version__)
print('CUDA 是否可用',torch.cuda.is_available())

# 檢查 mmcv
import mmcv
from mmcv.ops import get_compiling_cuda_version, get_compiler_version
print('MMCV版本', mmcv.__version__)
print('CUDA版本', get_compiling_cuda_version())
print('編譯器版本', get_compiler_version())

# 檢查 mmagic
import mmagic
print('MMagic版本', mmagic.__version__)

黑白照片上色

? ? ? ? 下載樣例圖?
?

python demo/mmagic_inference_demo.py --model-name inst_colorization --img data/test_colorization.jpg --result-out-dir outpusts/out_colorization.png

樣例效果： AI實(shí)戰(zhàn)營(yíng)：生成模型+底層視覺(jué)+AIGC多模態(tài) 算法庫(kù)MMagic 測(cè)試結(jié)果：

文生圖-Stable Diffusion

from mmagic.apis import MMagicInferencer
# 載入 Stable Diffusion 模型
sd_inferencer = MMagicInferencer(model_name='stable_diffusion')
# 指定Prompt文本
text_prompts = 'A panda is having dinner at KFC'
text_prompts = 'A Persian cat walking in the streets of New York'
# 執(zhí)行預(yù)測(cè)
sd_inferencer.infer(text=text_prompts, result_out_dir='outputs/sd_res.png')

?測(cè)試效果： AI實(shí)戰(zhàn)營(yíng)：生成模型+底層視覺(jué)+AIGC多模態(tài) 算法庫(kù)MMagic

?文生圖-Dreambooth

在數(shù)據(jù)集上訓(xùn)練Dreambooth, 數(shù)據(jù)集下載鏈接

python .\tools\train.py .\configs\dreambooth\dreambooth-lora.py

用訓(xùn)練好的模型做預(yù)測(cè)

import torch
from mmengine import Config
from mmagic.registry import MODELS
from mmagic.utils import register_all_modules

register_all_modules()

cfg = Config.fromfile('configs/dreambooth/dreambooth-lora.py')
dreambooth_lora = MODELS.build(cfg.model)
state = torch.load('work_dirs/dreambooth-lora/iter_1000.pth')['state_dict']

def convert_state_dict(state):
    state_dict_new = {}
    for k, v in state.items():
        if '.module' in k:
            k_new = k.replace('.module', '')
        else:
            k_new = k
        if 'vae' in k:
            if 'to_q' in k:
                k_new = k.replace('to_q', 'query')
            elif 'to_k' in k:
                k_new = k.replace('to_k', 'key')
            elif 'to_v' in k:
                k_new = k.replace('to_v', 'value')
            elif 'to_out' in k:
                k_new = k.replace('to_out.0', 'proj_attn')
        state_dict_new[k_new] = v
    return state_dict_new

dreambooth_lora.load_state_dict(convert_state_dict(state))
dreambooth_lora = dreambooth_lora.cuda()
samples = dreambooth_lora.infer('side view of sks dog', guidance_scale=5)
samples = dreambooth_lora.infer('ear close-up of sks dog', guidance_scale=5)

圖生圖-ControlNet-Canny

import cv2
import numpy as np
import mmcv
from mmengine import Config
from PIL import Image
from mmagic.registry import MODELS
from mmagic.utils import register_all_modules

register_all_modules()
#載入ControNet模型
cfg = Config.fromfile('configs/controlnet/controlnet-canny.py')
controlnet = MODELS.build(cfg.model).cuda()
#輸入Canny邊緣圖
control_url = 'https://user-images.githubusercontent.com/28132635/230288866-99603172-04cb-47b3-8adb-d1aa532d1d2c.jpg'
control_img = mmcv.imread(control_url)
control = cv2.Canny(control_img, 100, 200)
control = control[:, :, None]
control = np.concatenate([control] * 3, axis=2)
control = Image.fromarray(control)
#咒語(yǔ)Prompt
prompt = 'Room with blue walls and a yellow ceiling.'
#執(zhí)行預(yù)測(cè)
output_dict = controlnet.infer(prompt, control=control)
samples = output_dict['samples']
for idx, sample in enumerate(samples):
    sample.save(f'sample_{idx}.png')
controls = output_dict['controls']
for idx, control in enumerate(controls):
    control.save(f'control_{idx}.png')

圖生圖-ControlNet-Pose

import mmcv
from mmengine import Config
from PIL import Image

from mmagic.registry import MODELS
from mmagic.utils import register_all_modules

register_all_modules()

# 載入ControlNet模型
cfg = Config.fromfile('configs/controlnet/controlnet-pose.py')
# convert ControlNet's weight from SD-v1.5 to Counterfeit-v2.5
cfg.model.unet.from_pretrained = 'gsdf/Counterfeit-V2.5'
cfg.model.vae.from_pretrained = 'gsdf/Counterfeit-V2.5'
cfg.model.init_cfg['type'] = 'convert_from_unet'
controlnet = MODELS.build(cfg.model).cuda()
# call init_weights manually to convert weight
controlnet.init_weights()

# 咒語(yǔ)Prompt
prompt = 'masterpiece, best quality, sky, black hair, skirt, sailor collar, looking at viewer, short hair, building, bangs, neckerchief, long sleeves, cloudy sky, power lines, shirt, cityscape, pleated skirt, scenery, blunt bangs, city, night, black sailor collar, closed mouth'

# 輸入Pose圖
control_url = 'https://user-images.githubusercontent.com/28132635/230380893-2eae68af-d610-4f7f-aa68-c2f22c2abf7e.png'
control_img = mmcv.imread(control_url)
control = Image.fromarray(control_img)
control.save('control.png')

# 執(zhí)行預(yù)測(cè)
output_dict = controlnet.infer(prompt, control=control, width=512, height=512, guidance_scale=7.5)
samples = output_dict['samples']
for idx, sample in enumerate(samples):
    sample.save(f'sample_{idx}.png')
controls = output_dict['controls']
for idx, control in enumerate(controls):
    control.save(f'control_{idx}.png')

圖生圖-ControlNet Animation

方式一:Gradio命令行

python .\demo\gradio_controlnet_animation.py

方式二：MMagic API?

# 導(dǎo)入工具包
from mmagic.apis import MMagicInferencer

# Create a MMEdit instance and infer
editor = MMagicInferencer(model_name='controlnet_animation')

# 指定 prompt 咒語(yǔ)
prompt = 'a girl, black hair, T-shirt, smoking, best quality, extremely detailed'
negative_prompt = 'longbody, lowres, bad anatomy, bad hands, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality'

# 待測(cè)視頻
# https://user-images.githubusercontent.com/12782558/227418400-80ad9123-7f8e-4c1a-8e19-0892ebad2a4f.mp4
video = '../run_forrest_frames_rename_resized.mp4'
save_path = '../output_video.mp4'

# 執(zhí)行預(yù)測(cè)
editor.infer(video=video, prompt=prompt, image_width=512, image_height=512, negative_prompt=negative_prompt, save_path=save_path)

訓(xùn)練自己的ControlNet

????????下載數(shù)據(jù)集文章來(lái)源地址http://www.zghlxwxcb.cn/news/detail-513274.html

python .\tools\train.py .\configs\controlnet\controlnet-1xb1-fill50k.py

到了這里，關(guān)于AI實(shí)戰(zhàn)營(yíng)：生成模型+底層視覺(jué)+AIGC多模態(tài) 算法庫(kù)MMagic的文章就介紹完了。如果您還想了解更多內(nèi)容，請(qǐng)?jiān)谟疑辖撬阉鱐OY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章，希望大家以后多多支持TOY模板網(wǎng)！

本文來(lái)自互聯(lián)網(wǎng)用戶(hù)投稿，該文觀點(diǎn)僅代表作者本人，不代表本站立場(chǎng)。本站僅提供信息存儲(chǔ)空間服務(wù)，不擁有所有權(quán)，不承擔(dān)相關(guān)法律責(zé)任。如若轉(zhuǎn)載，請(qǐng)注明出處：如若內(nèi)容造成侵權(quán)/違法違規(guī)/事實(shí)不符，請(qǐng)點(diǎn)擊違法舉報(bào)進(jìn)行投訴反饋，一經(jīng)查實(shí)，立即刪除！

分享到：

領(lǐng)支付寶紅包贊助服務(wù)器費(fèi)用

AI大模型日?qǐng)?bào)#04-08：多模態(tài)醫(yī)療視覺(jué)、復(fù)現(xiàn)OpenAI RLHF、Mistral Large引入Amazon
導(dǎo)讀：?歡迎閱讀《AI大模型日?qǐng)?bào)》，內(nèi)容基于Python爬蟲(chóng)和LLM自動(dòng)生成。目前采用“文心一言”生成了每條資訊的摘要。標(biāo)題: 超10秒高分辨率，北大Open Sora視頻生成更強(qiáng)了，還支持華為芯片 ? 摘要:? 北大團(tuán)隊(duì)與兔展聯(lián)合發(fā)起的Open Sora Plan旨在通過(guò)開(kāi)源社區(qū)復(fù)現(xiàn)OpenAI的Sora視頻
2024年04月17日
瀏覽(32)
4.AI人工智能大模型匯總：類(lèi)GPT系列模型、模型中轉(zhuǎn)站Auto-GPT、多模態(tài)大模型、視覺(jué)模型、自然語(yǔ)言模型
模型名稱(chēng) 發(fā)布方類(lèi)型開(kāi)源類(lèi)型原始模型框架 paddle版本模型能力模型語(yǔ)言模型參數(shù) 簡(jiǎn)介模型鏈接體驗(yàn)鏈接 paddle版本鏈接項(xiàng)目鏈接備注發(fā)布日期創(chuàng)建人模型星火認(rèn)知大模型科大訊飛語(yǔ)言模型未發(fā)布暫無(wú)paddle 文生文中文未知 https://xinghuo.xfyun.cn/?ch=bdtg-xh-cy01bd_vid=1
2024年02月04日
瀏覽(39)
AIGC實(shí)戰(zhàn)——生成模型簡(jiǎn)介
生成式人工智能 ( Generative Artificial Intelligence , GAI ) 是一種人工智能方法，旨在通過(guò)學(xué)習(xí)訓(xùn)練數(shù)據(jù)的分布模型來(lái)生成新的、原創(chuàng)的數(shù)據(jù)。人工智能生成內(nèi)容 ( Artificial Intelligence Generated Content , AIGC ) 是生成式人工智能的一個(gè)具體應(yīng)用和實(shí)現(xiàn)方式，是指利用人工智能技術(shù)生成各種形
2024年02月08日
瀏覽(15)
AI實(shí)戰(zhàn)營(yíng)：MMPose開(kāi)源算法庫(kù)
目錄 RTMPose關(guān)鍵點(diǎn)檢測(cè)全流程 MMPose官方可視化工具visualizer 代碼實(shí)現(xiàn)： MMPose預(yù)訓(xùn)練模型預(yù)測(cè)-命令行預(yù)測(cè)單張圖 Loads checkpoint by http backend from path: https://download.openmmlab.com/mmdetection/v2.0/faster_rcnn/faster_rcnn_r50_fpn_1x_coco/faster_rcnn_r50_fpn_1x_coco_20200130-047c8118.pth Loads checkpoint by http backen
2024年02月07日
瀏覽(16)
AI之LLM/MLM：Nvidia官網(wǎng)人工智能大模型工具合集(大語(yǔ)言模型/多模態(tài)模型，文本生成/圖像生成/視頻生成)的簡(jiǎn)介、使用方法、案例應(yīng)用之詳細(xì)攻略
AI之LLM/MLM：Nvidia官網(wǎng)人工智能大模型工具合集(大語(yǔ)言模型/多模態(tài)模型，文本生成/圖像生成/視頻生成)的簡(jiǎn)介、使用方法、案例應(yīng)用之詳細(xì)攻略目錄 Nvidia官網(wǎng)人工智能大模型工具合集的簡(jiǎn)介 1、網(wǎng)站主要功能包括: Nvidia官網(wǎng)人工智能大模型工具合集的使用方法 1、SDXL-Turbo的使
2024年04月28日
瀏覽(45)
AIGC技術(shù)研究與應(yīng)用 ---- 下一代人工智能：新范式！新生產(chǎn)力?。?.3-大模型發(fā)展歷程之圖像、視頻生成與視覺(jué)大模型）
2024年02月09日
瀏覽(96)
字節(jié)技術(shù)大牛跑步進(jìn)入AIGC創(chuàng)業(yè)，聚焦視覺(jué)領(lǐng)域，搭建算法平臺(tái)，還是多模態(tài)的那種...
衡宇發(fā)自凹非寺量子位 | 公眾號(hào) QbitAI 3月最后一天，王長(zhǎng)虎在龍湖集團(tuán)的last day。這位字節(jié)跳動(dòng)前視覺(jué)技術(shù)負(fù)責(zé)人、AI Lab總監(jiān)辭職掛印，火速啟程下一站： AIGC創(chuàng)業(yè)，成立新公司愛(ài)詩(shī)科技。他拉團(tuán)隊(duì)自起爐灶，要打造一個(gè) 聚焦AIGC的視覺(jué)多模態(tài)算法平臺(tái) ?，覆蓋視覺(jué)相關(guān)
2024年02月09日
瀏覽(19)
AIGC內(nèi)容分享(二十)：「AI視頻生成」技術(shù)核心基礎(chǔ)知識(shí)和模型應(yīng)用
目錄何為AI視頻？一、技術(shù)發(fā)展概況二、代表模型及應(yīng)用??????? 三、仍存在許多技術(shù)難點(diǎn) 「 AI 視頻」通常指的是由人工智能（AI）技術(shù)生成或處理的視頻。這可能包括使用深度學(xué)習(xí)、計(jì)算機(jī)視覺(jué)和其他相關(guān)技術(shù)來(lái)改善視頻的質(zhì)量、內(nèi)容或生成全新的視頻內(nèi)容。一
2024年01月18日
瀏覽(25)
【AIGC】百度：跨模態(tài)內(nèi)容生成技術(shù)與應(yīng)用
內(nèi)容來(lái)源：機(jī)器之心，百度文心一格總架構(gòu)師肖欣延博士，《跨模態(tài)內(nèi)容生成與技術(shù)與應(yīng)用》的演講。從圖像生成角度來(lái)看，下圖左邊是 2020 年圖像生的水平，是很有代表性的一個(gè)拍賣(mài)畫(huà)作。到了 2022 年，技術(shù)已經(jīng)相比之前強(qiáng)了很多。我們?nèi)我庹f(shuō)一句話就能生成一張非常精致
2024年02月09日
瀏覽(25)
AI圖像（AIGC for PIC）大模型實(shí)戰(zhàn)|Stable Diffusion
AI GC text to pic 圖像生成模型 ?目前隨著AIGC模型的火爆，AI內(nèi)容創(chuàng)作遠(yuǎn)超人類(lèi)創(chuàng)造水平和能力，極大了提升了創(chuàng)作空間。為此我們要接觸新鮮事物，用于嘗試新技術(shù)。那針對(duì)目前火爆的AImodel我們開(kāi)始進(jìn)行學(xué)習(xí)，嘗試本地化部署，生成自己的模型。先感性的認(rèn)識(shí)下模型的基礎(chǔ)知
2023年04月24日
瀏覽(25)