Take the words right out of my mouth with English
AI-powered voice tool that helps non-English-speaking parents create an English language environment for their children.
When your child speaks (in Chinese or English), Chatterbox:
- Listens and transcribes their speech
- Translates and responds in natural, child-friendly English
- Speaks the English response aloud
- Python 3.10+
- A microphone
- A DeepSeek API key
git clone https://github.com/raphael-alex/chatterbox.git
cd chatterbox
pip install -r requirements.txt# Set your API key
export DEEPSEEK_API_KEY="sk-..."
# Or copy and edit config.yaml
cp config.yaml my_config.yaml
# Edit my_config.yaml with your settingspython main.pyPress Ctrl+C to exit.
Edit config.yaml to customize:
| Setting | Options | Default |
|---|---|---|
| ASR engine | whisper-api, whisper-local |
whisper-api |
| LLM engine | openai, deepseek |
openai |
| TTS engine | edge-tts |
edge-tts |
| TTS voice | Edge-TTS voice names | en-US-JennyNeural |
| VAD silence duration | seconds | 1.5 |
asr:
engine: whisper-local
whisper_local:
model_size: base # tiny, base, small, medium, large
device: autoMicrophone → VAD → ASR → LLM → TTS → Speaker
All components use an adapter pattern — swap engines via config without code changes.
MIT