CLI tool for running text through the OpenAI Text to speech API and speaking or saving the result
See ospeak: a CLI tool for speaking text in the terminal via OpenAI for background on this project.
Install this tool using pipx:
pipx install ospeakNote, on MacOS there's a broken dependency with Python 3.12 so instead run:
pipx install --python /path/to/python3.11 ospeakThis tool also depends on ffmpeg. You can install that on macOS using Homebrew like this:
brew install ffmpegNote that ospeak has dependencies that are currently incompatible with LLM (see this issue) so the two tools need to be installed in different virtual environments, hence the recommendation to use pipx.
To get your computer to say something, run:
ospeak "Hello there"You will need an OpenAI API key. You can set that as an environment variable:
export OPENAI_API_KEY="..."Or you can pass it using --token:
ospeak --token "..." "Hello there"You can pipe content into the tool:
echo "Hello there" | ospeakUse -v/--voice VOICE to select a voice. The default is alloy. The other options are:
echofableonyxnovashimmerYou can pass -v all to hear all of the voices, each with the name of the voice spoken first.
ospeak "This is my voice" -v allThe -m/--model option can be used to select a different model. The default is tts-1. Use -m tts-1-hd for a higher quality model that takes slightly longer to generate audio:
ospeak "This is higher quality" -m tts-1-hdTo adjust the speed, pass -x/--speed with a value between 0.25 and 4 (the default is 1.0):
ospeak "This is my fast voice" -x 2To write the audio to a file, pass -o/--output with a filename ending in either .mp3 or .wav:
ospeak "This is my voice" -o voice.mp3This will not speak out loud. If you want it to also speak, add the -s/--speak option:
ospeak "This is my voice" -o voice.mp3 -sUsage: ospeak [OPTIONS] [TEXT]
CLI tool for running text through OpenAI Text to speech
Set the OPENAI_API_KEY environment variable to your OpenAI API key to avoid
using the --token option every time.
Example usage:
ospeak "Everyone deserves a pelican" --voice alloy -x 1.5
Options:
--version Show the version and exit.
-v, --voice [alloy|echo|fable|onyx|nova|shimmer|all]
Voice to use
-m, --model TEXT Model to use - defaults to tts-1
-o, --output FILE Save audio to this file on disk
-x, --speed FLOAT RANGE Speed of the voice [0.25<=x<=4.0]
-s, --speak Speak the text even when saving to a file
--token TEXT OpenAI API key
--help Show this message and exit.
To contribute to this tool, first checkout the code. Then create a new virtual environment:
cd ospeak
python -m venv venv
source venv/bin/activateNow install the dependencies and test dependencies:
pip install -e '.[test]'To run the tests:
pytest