微軟語(yǔ)音合成（tts）服務(wù)申請(qǐng)和調(diào)用

2年前作者：Zero_to_zero1234分類：Toy博客閱讀(27)違法舉報(bào)

這篇具有很好參考價(jià)值的文章主要介紹了微軟語(yǔ)音合成（tts）服務(wù)申請(qǐng)和調(diào)用。希望對(duì)大家有所幫助。如果存在錯(cuò)誤或未考慮完全的地方，請(qǐng)大家不吝賜教，您也可以點(diǎn)擊"舉報(bào)違法"按鈕提交疑問(wèn)。

1、申請(qǐng)賬戶：

https://azure.microsoft.com/zh-cn/free/

微軟語(yǔ)音合成（tts）服務(wù)申請(qǐng)和調(diào)用
這里有個(gè)視頻教程，根據(jù)此完成申請(qǐng)流程：
https://www.bilibili.com/video/BV15a4y1W7re?vd_source=bf07f28d37849885d215dc3aea189eba
申請(qǐng)完成后，就可以到這里申請(qǐng)資源：
https://portal.azure.com/#home

點(diǎn)擊資源組，里面就有部署好的服務(wù)了
微軟語(yǔ)音合成（tts）服務(wù)申請(qǐng)和調(diào)用

點(diǎn)擊這里，可以獲取 subscription_key，另外還有個(gè)就是位置service_region (上圖就是east asia)，這兩個(gè)后面會(huì)用到。

2、調(diào)用服務(wù)

在完成微軟azure服務(wù)賬號(hào)申請(qǐng)后，就可以進(jìn)行調(diào)用了。代碼：

'''
After you've set your subscription key, run this application from your working
directory with this command: python TTSSample.py
'''
import os, requests, time
from xml.etree import ElementTree

# This code is required for Python 2.7
try: input = raw_input
except NameError: pass

'''
If you prefer, you can hardcode your subscription key as a string and remove
the provided conditional statement. However, we do recommend using environment
variables to secure your subscription keys. The environment variable is
set to SPEECH_SERVICE_KEY in our sample.
For example:
subscription_key = "Your-Key-Goes-Here"
'''

if 'SPEECH_SERVICE_KEY' in os.environ:
    subscription_key = os.environ['SPEECH_SERVICE_KEY']
else:
    print('Environment variable for your subscription key is not set.')
    exit()

class TextToSpeech(object):
    def __init__(self, subscription_key):
        self.subscription_key = subscription_key
        self.tts = input("What would you like to convert to speech: ")
        self.timestr = time.strftime("%Y%m%d-%H%M")
        self.access_token = None

    '''
    The TTS endpoint requires an access token. This method exchanges your
    subscription key for an access token that is valid for ten minutes.
    '''
    def get_token(self):
        fetch_token_url = "https://westus.api.cognitive.microsoft.com/sts/v1.0/issueToken"
        headers = {
            'Ocp-Apim-Subscription-Key': self.subscription_key
        }
        response = requests.post(fetch_token_url, headers=headers)
        self.access_token = str(response.text)

    def save_audio(self):
        base_url = 'https://westus.tts.speech.microsoft.com/'
        path = 'cognitiveservices/v1'
        constructed_url = base_url + path
        headers = {
            'Authorization': 'Bearer ' + self.access_token,
            'Content-Type': 'application/ssml+xml',
            'X-Microsoft-OutputFormat': 'riff-24khz-16bit-mono-pcm',
            'User-Agent': 'YOUR_RESOURCE_NAME'
        }
        xml_body = ElementTree.Element('speak', version='1.0')
        xml_body.set('{http://www.w3.org/XML/1998/namespace}lang', 'en-us')
        voice = ElementTree.SubElement(xml_body, 'voice')
        voice.set('{http://www.w3.org/XML/1998/namespace}lang', 'en-US')
        voice.set('name', 'en-US-Guy24kRUS') # Short name for 'Microsoft Server Speech Text to Speech Voice (en-US, Guy24KRUS)'
        voice.text = self.tts
        body = ElementTree.tostring(xml_body)

        response = requests.post(constructed_url, headers=headers, data=body)
        '''
        If a success response is returned, then the binary audio is written
        to file in your working directory. It is prefaced by sample and
        includes the date.
        '''
        if response.status_code == 200:
            with open('sample-' + self.timestr + '.wav', 'wb') as audio:
                audio.write(response.content)
                print("\nStatus code: " + str(response.status_code) + "\nYour TTS is ready for playback.\n")
        else:
            print("\nStatus code: " + str(response.status_code) + "\nSomething went wrong. Check your subscription key and headers.\n")
            print("Reason: " + str(response.reason) + "\n")

    def get_voices_list(self):
        base_url = 'https://westus.tts.speech.microsoft.com/'
        path = 'cognitiveservices/voices/list'
        constructed_url = base_url + path
        headers = {
            'Authorization': 'Bearer ' + self.access_token,
        }
        response = requests.get(constructed_url, headers=headers)
        if response.status_code == 200:
            print("\nAvailable voices: \n" + response.text)
        else:
            print("\nStatus code: " + str(response.status_code) + "\nSomething went wrong. Check your subscription key and headers.\n")

if __name__ == "__main__":
    app = TextToSpeech(subscription_key)
    app.get_token()
    app.save_audio()
    # Get a list of voices https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/rest-text-to-speech#get-a-list-of-voices
    # app.get_voices_list()

參考文檔：
https://docs.microsoft.com/zh-cn/azure/cognitive-services/speech-service/
https://github.com/Azure-Samples/Cognitive-Speech-TTS/blob/28681c8292c95aebb36d3696b8822b4cd17c3c45/Samples-Http/OLD/Python/TTSSample.py文章來(lái)源地址http://www.zghlxwxcb.cn/news/detail-503833.html

到了這里，關(guān)于微軟語(yǔ)音合成（tts）服務(wù)申請(qǐng)和調(diào)用的文章就介紹完了。如果您還想了解更多內(nèi)容，請(qǐng)?jiān)谟疑辖撬阉鱐OY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章，希望大家以后多多支持TOY模板網(wǎng)！

本文來(lái)自互聯(lián)網(wǎng)用戶投稿，該文觀點(diǎn)僅代表作者本人，不代表本站立場(chǎng)。本站僅提供信息存儲(chǔ)空間服務(wù)，不擁有所有權(quán)，不承擔(dān)相關(guān)法律責(zé)任。如若轉(zhuǎn)載，請(qǐng)注明出處：如若內(nèi)容造成侵權(quán)/違法違規(guī)/事實(shí)不符，請(qǐng)點(diǎn)擊違法舉報(bào)進(jìn)行投訴反饋，一經(jīng)查實(shí)，立即刪除！

分享到：

領(lǐng)支付寶紅包贊助服務(wù)器費(fèi)用

Unity C# 之 Azure 微軟SSML語(yǔ)音合成TTS流式獲取音頻數(shù)據(jù)以及表情嘴型 Animation 的簡(jiǎn)單整理
目錄 Unity C# 之 Azure 微軟SSML語(yǔ)音合成TTS流式獲取音頻數(shù)據(jù)以及表情嘴型 Animation 的簡(jiǎn)單整理一、簡(jiǎn)單介紹二、實(shí)現(xiàn)原理三、注意事項(xiàng) 四、實(shí)現(xiàn)步驟五、關(guān)鍵代碼 Unity 工具類，自己整理的一些游戲開發(fā)可能用到的模塊，單獨(dú)獨(dú)立使用，方便游戲開發(fā)。本節(jié)介紹，這里在使用
2024年02月12日
瀏覽(30)
微軟（TTS）文本轉(zhuǎn)語(yǔ)音服務(wù)API實(shí)現(xiàn)
此博客實(shí)現(xiàn)與java實(shí)現(xiàn)微軟文本轉(zhuǎn)語(yǔ)音（TTS）經(jīng)驗(yàn)總結(jié)_java tts_${簡(jiǎn)簡(jiǎn)單單}的博客-CSDN博客之上，首先感謝博客源碼的提供，本人在上面添加了一些詳細(xì)的注釋，方便大家跟好的理解和使用，畢竟我已經(jīng)用原文調(diào)試了一下午才調(diào)通，一些細(xì)節(jié)的問(wèn)題給大家標(biāo)注出來(lái)，免得浪費(fèi)大家
2024年02月07日
瀏覽(19)
Unity調(diào)用微軟SpeechLib.Dll的C#類實(shí)現(xiàn)語(yǔ)音合成功能
using System.Collections; using System.Collections.Generic; using UnityEngine; using SpeechLib; public class Speech : MonoBehaviour { // Start is called before the first frame update void Start() { } 將Interop.SpeechLib.dll文件導(dǎo)入U(xiǎn)nity，然后把上面的腳本掛載到游戲?qū)ο笊暇湍軠y(cè)試語(yǔ)音合成的效果了。經(jīng)測(cè)試調(diào)用v.Speak這個(gè)方
2024年02月13日
瀏覽(31)
[Unity+OpenAI TTS] 集成openAI官方提供的語(yǔ)音合成服務(wù)，構(gòu)建海王暖男數(shù)字人
? ? ? ? 最近openAI官方發(fā)布了很多新功能，其中就包括了最新發(fā)布的TTS語(yǔ)音合成服務(wù)的api接口。說(shuō)到這個(gè)語(yǔ)音合成接口，大家可能會(huì)比較陌生，但是說(shuō)到chatgpt官方應(yīng)用上的聊天機(jī)器人，那個(gè)臺(tái)灣腔的海王暖男的聲音，可能就有印象了吧。那么從官方文檔中，可以發(fā)現(xiàn)，openA
2024年02月04日
瀏覽(27)
文字轉(zhuǎn)語(yǔ)音 - 搭建微軟tts整合web服務(wù)提供api接口（免費(fèi)）
微軟tts是業(yè)界公認(rèn)文字轉(zhuǎn)語(yǔ)音效果最佳本文使用docker搭建微軟tts服務(wù)并提供api接口對(duì)外提供服務(wù) 對(duì)接官方免費(fèi)在線體驗(yàn)接口，搭建后可免費(fèi)進(jìn)行調(diào)用使用，不保證永久穩(wěn)定可用 url ：http://127.0.0.1:5003/tts method ：POST 參數(shù) 類型描述 text string 語(yǔ)音文字內(nèi)容 voiceName string 發(fā)音人（
2024年02月11日
瀏覽(19)
如何注冊(cè)微軟Azure并獲取語(yǔ)音合成服務(wù)？
按步驟，一步步來(lái)。使用條件，以下可選: 1、有信用卡 2、有學(xué)生郵箱、學(xué)校郵箱。步驟： 1、打開地址，去注冊(cè)。信用卡注冊(cè)：文本轉(zhuǎn)語(yǔ)音 – 真實(shí) AI 語(yǔ)音生成器 | Microsoft Azure 學(xué)生郵箱: 面向?qū)W生的 Azure - 免費(fèi)帳戶額度 | Microsoft Azure 2、注冊(cè)后去后臺(tái)，打開語(yǔ)音服務(wù)。后
2024年02月03日
瀏覽(19)
使用微軟Azure的tts文本轉(zhuǎn)語(yǔ)音服務(wù)出現(xiàn)java.lang.UnsatisfiedLinkError
最近，在使用微軟tts文本轉(zhuǎn)語(yǔ)音的speech服務(wù)時(shí)，項(xiàng)目正常整合了微軟的依賴，服務(wù)也正常啟動(dòng)。但是只要調(diào)用微軟文本轉(zhuǎn)語(yǔ)音服務(wù)api時(shí)，就會(huì)出現(xiàn)如下報(bào)錯(cuò)。該方法是一個(gè)native方法，我以為是依賴中的dll文件沒(méi)有加載到，結(jié)果檢查不是。最后分析：native依賴的是底層實(shí)現(xiàn)的
2024年02月14日
瀏覽(26)
離線語(yǔ)音交互技術(shù)路線之語(yǔ)音合成(TTS)篇
??在ChatGPT大行其道之際，我心血來(lái)潮想要研究研究如何實(shí)現(xiàn)離線語(yǔ)音交互，把它和ChatGPT相結(jié)合，自己嘗試實(shí)現(xiàn)個(gè)語(yǔ)音助手玩玩。本篇文章主要先從整體上分析了離線語(yǔ)音交互的技術(shù)實(shí)現(xiàn)路線，以及每個(gè)環(huán)節(jié)可能用到的參考技術(shù)，然后詳細(xì)闡述了其中一個(gè)環(huán)節(jié)：語(yǔ)音合成（
2024年02月09日
瀏覽(22)
語(yǔ)音合成工具Coqui TTS安裝及體驗(yàn)
先介紹兩種免費(fèi)的語(yǔ)音合成工具官網(wǎng) http://balabolka.site/balabolka.htm 是一種基于微軟Speech API (SAPI)的免費(fèi)語(yǔ)音合成工具，只是簡(jiǎn)單的發(fā)音合成，效果比較生硬官網(wǎng) https://coqui.ai/ 是基于深度學(xué)習(xí)的語(yǔ)音合成軟件，效果較好 Windows下安裝Coqui TTS 安裝環(huán)境要求python運(yùn)行環(huán)境：https://
2024年02月15日
瀏覽(27)
Python使用PaddleSpeech實(shí)現(xiàn)語(yǔ)音識(shí)別（ASR）、語(yǔ)音合成（TTS）
目錄安裝語(yǔ)音識(shí)別補(bǔ)全標(biāo)點(diǎn) 語(yǔ)音合成參考 PaddleSpeech是百度飛槳開發(fā)的語(yǔ)音工具注意，PaddleSpeech不支持過(guò)高版本的Python，因?yàn)樵诟甙姹镜腜ython中，飛槳不再提供paddle.fluid API。這里面我用的是Python3.7 需要通過(guò)3個(gè)pip命令安裝PaddleSpeech：在使用的時(shí)候，urllib3庫(kù)可能會(huì)報(bào)錯(cuò)，因
2024年04月25日
瀏覽(28)