elevenlabs
1.0.0
非正式的elevenlabs.io(11.ai)语音综合客户端
该库与Elevenlabs没有任何关系。
Elevenlabs的官方API文档(该客户端)可以在此处找到。
此GO客户端提供了一个简单的接口,可以创建合成的声音并向elevenlabs.io提出tts(文本到语音)请求。
作为先决条件,您必须已经拥有elevenlabs.io的帐户。创建帐户后,您可以从此处获取API密钥。
要测试一个示例say程序,请运行:
go install github.com/taigrr/elevenlabs/cmd/say@latest
设置XI_API_KEY环境变量,然后将其输送一些文本以使其旋转!
要使用此库,请创建一个新客户端,然后将TTS请求发送到语音。以下代码块说明了如何使用流端点复制SAI/ESPEAK命令。我选择使用Faiface的蜂鸣器包,但是您也可以将文件保存到MP3上。
package main
import (
"bufio"
"context"
"io"
"log"
"os"
"time"
"github.com/faiface/beep"
"github.com/faiface/beep/mp3"
"github.com/faiface/beep/speaker"
"github.com/taigrr/elevenlabs/client"
"github.com/taigrr/elevenlabs/client/types"
)
func main () {
ctx := context . Background ()
// load in an API key to create a client
client := client . New ( os . Getenv ( "XI_API_KEY" ))
// fetch a list of voice IDs from elevenlabs
ids , err := client . GetVoiceIDs ( ctx )
if err != nil {
panic ( err )
}
// prepare a pipe for streaming audio directly to beep
pipeReader , pipeWriter := io . Pipe ()
reader := bufio . NewReader ( os . Stdin )
text , _ := reader . ReadString ( 'n' )
go func () {
// stream audio from elevenlabs using the first voice we found
err = client . TTSStream ( ctx , pipeWriter , text , ids [ 0 ], types. SynthesisOptions { Stability : 0.75 , SimilarityBoost : 0.75 , Style : 0.0 , UseSpeakerBoost : true })
if err != nil {
panic ( err )
}
pipeWriter . Close ()
}()
// decode and prepare the streaming mp3 as it comes through
streamer , format , err := mp3 . Decode ( pipeReader )
if err != nil {
log . Fatal ( err )
}
defer streamer . Close ()
speaker . Init ( format . SampleRate , format . SampleRate . N ( time . Second / 10 ))
done := make ( chan bool )
// play the audio
speaker . Play ( beep . Seq ( streamer , beep . Callback ( func () {
done <- true
})))
<- done
}下面的示例演示了如何使用Sound Generation API生成声音效果:
package main
import (
"context"
"os"
"github.com/taigrr/elevenlabs/client"
)
func main () {
ctx := context . Background ()
// Create a new client with your API key
client := client . New ( os . Getenv ( "XI_API_KEY" ))
// Generate a sound effect and save it to a file
f , err := os . Create ( "footsteps.mp3" )
if err != nil {
panic ( err )
}
defer f . Close ()
// Basic usage (using default duration and prompt influence)
err = client . SoundGenerationWriter ( ctx , f , "footsteps on wooden floor" , 0 , 0 )
if err != nil {
panic ( err )
}
// Advanced usage with custom duration and prompt influence
audio , err := client . SoundGeneration (
ctx ,
"heavy rain on a tin roof" ,
5.0 , // Set duration to 5 seconds
0.5 , // Set prompt influence to 0.5
)
if err != nil {
panic ( err )
}
os . WriteFile ( "rain.mp3" , audio , 0644 )
}