.md

Kyle Phillips

Creative Technologist, Google NYC

Gemini Live API Web Console

Enabling low-latency conversational AI on the web

December, 2024

  • Launched in December 2024, I worked alongside Deepmind's Gemini team for their upcoming release of the Gemini 2.0 Flash model, part of the release was Google's first low-latency conversational LLM, the "Gemini Live API" and the quick follow-up of a new Unified SDK. As others were focused on python, I was most excited about what conversational AI meant for the web and I knew there were a few complex pieces to work through related to WebSocket management and audio processing.
  • Since the initial launch of this project it has had an outsized impact. It has been used in Projects/Gemini Robotics as well as in IO 2025 Demos in the developer keynote and AI sandbox. It is often found at the forefront of where AI finds personality.

1 Reference

  1. Projects/Gemini Robotics