Gemini 3.1: Real-World Voice Recognition with Flash Live: Making Your LINE Bot Understand You
python
dev.to
Background Google released Gemini 3.1 Flash Live at the end of March 2026 March, focusing on "making audio AI more natural and reliable." This model is specifically designed for real-time two-way voice conversations, with low latency, interruptibility, and multi-language support. I happened to have a LINE Bot project (linebot-helper-python) on hand, which already handles text, images, URLs, PDFs, and YouTube, but completely ignores voice messages: User sends a voice message Bot: