I've been working on Churo, an open-source voice assistant built entirely in Python.
It features high-quality speech-to-text and text-to-speech, web search, image understanding, and agentic capabilities. It runs with Ollama models and is designed to be easy to modify and extend.
The goal is to provide a capable, local-first voice assistant that developers can actually inspect, customize, and build on.
Repository: https://github.com/MathObsession/Churo-assistant
Feedback, issues, and contributions are welcome.