Reachy Mini goes fully local

Amir Mahla's avatar
Andres Marafioti's avatar

After building your Reachy Mini, you’ll install the conversation app and start talking to it. Until now, you had to send your audio to a server. But not anymore. Today we’ll walk you through running the whole stack locally.

This stack is powered by speech-to-speech, our cascaded VAD → STT → LLM → TTS pipeline that exposes a Realtime API-compatible /v1/realtime WebSocket. Once you launch the backend, point the robot at it from the UI.

Cascades are the most flexible option in the open-source landscape today, and with the right pieces they’re also the fastest. We’ll recommend the components we like best, but the whole point of a cascade is that you can swap them. New models drop every week.

TL;DR