Reachy Mini goes fully local

After building your Reachy Mini, you’ll install the conversation app and start talking to it. Until now, you had to send your audio to a server. But not anymore. Today we’ll walk you through running the whole stack locally.

This stack is powered by speech-to-speech, our cascaded VAD → STT → LLM → TTS pipeline that exposes a Realtime API-compatible /v1/realtime WebSocket. Once you launch the backend, point the robot at it from the UI.

Cascades are the most flexible option in the open-source landscape today, and with the right pieces they’re also the fastest. We’ll recommend the components we like best, but the whole point of a cascade is that you can swap them. New models drop every week.

TL;DR

Deploy a local speech backend for your Reachy Mini.

We use our speech-to-speech library, a cascade approach.

To finish reading, please visit source site