GTTS下載GTTS源代碼下載

GTTS

Ai源碼

v0.0.8

下載

雙子座文本到語音

使用Google AI（Gemini）將書面內容轉換為語音，以進行文本生成和基於Internet的信息檢索。

❓如何工作

該項目基於test/app.ts中的示例。它執行以下步驟：

獲取聲音輸入
向Google Gemini API發送請求以接收AI生成的響應
使用文本到語音（TTS）技術自動轉換對語音的響應
播放生成的音頻

？項目註釋

該項目已在Linux（Ubuntu 24.04 LTS X86_64）上進行了測試。 Windows用戶可以通過SourceForge安裝Sox。 MacOS特定的信息當前不可用。

任務	優先事項	地位
實施雙子座聊天	高的	✅完成
發展語音識別	高的	✅完成
實施音頻語言檢測	高的	✅完成
實施文本語言檢測	中等的	✅完成
實施音頻播放器	低的	✅完成
定義枚舉	低的	✅完成
集成調試	低的	✅完成

？項目安裝

在使用此存儲庫之前，請確保您的系統上安裝以下依賴關係：

Linux

Sox ： sudo apt-get install sox
libsox-fmt-all ： sudo apt-get install libsox-fmt-all
ffmpeg ： sudo apt install ffmpeg

視窗

Sox ：從SourceForge下載
FFMPEG ： choco install ffmpeg （使用巧克力）或從官方網站下載

macos

目前尚不可用MACOS特定的安裝說明。

要安裝軟件包，請根據您首選的軟件包管理器使用以下命令之一：

 # npm
$ npm install git+https://github.com/Stawa/GTTS.git --legacy-peer-deps
# Bun
$ bun install git+https://github.com/Stawa/GTTS.git --trust

？項目示例

在研究示例之前，請確保您擁有以下API鍵和憑據：

Google Gemini API密鑰（ lib.GoogleGemini ）
- 從Google Cloud Console獲取
tiktok sessionid （ lib.TextToSpeech ）
- 登錄後提取Tiktok瀏覽器餅乾
Google語音API密鑰（ lib.VoiceRecognition.fetchTranscriptGoogle ）
- 從Google Cloud Console憑據生成
Deepgram API鍵（ lib.VoiceRecognition.fetchTranscriptDeepgram ）
- 創建一個帳戶並從Deepgram Console獲取
Edenai API鍵（ lib.SummarizeText ）
- 註冊並從Edenai儀表板上取回

確保將這些API鍵安全地存儲，並且永遠不要將其投入版本控制。考慮使用環境變量或安全的密鑰管理系統。

這是一個簡潔的示例，演示瞭如何使用Google Gemini API生成響應：

 import { GoogleGemini } from "@stawa/gtts" ;
import dotenv from "dotenv" ;
dotenv . config ( ) ;

const gemini = new GoogleGemini ( {
  apiKey : process . env . GEMINI_API_KEY ,
  model : "gemini-1.5-flash" ,
  enableLogging : true ,
} ) ;

async function main ( ) {
  try {
    const question = "When was Facebook launched?" ;
    console . log ( `Question: ${ question } ` ) ;

    const response = await gemini . chat ( question ) ;
    console . log ( `Gemini's response: ${ response } ` ) ;
  } catch ( error ) {
    console . error ( "An error occurred:" , error ) ;
  }
}

main ( ) ;