Skip to main content

Google AI and LiveKit

Build world-class realtime AI apps with Google AI and LiveKit Agents.

Gemini playground

Play with the Gemini Live API in this LiveKit-powered playground

Gemini playground

Google AI ecosystem support

Google AI provides some of the most powerful AI models and services today, which integrate into LiveKit Agents in the following ways:

  • Gemini: A family of general purpose high-performance LLMs.
  • Google Cloud STT and TTS: Affordable, production-grade models for transcription and speech synthesis.
  • Gemini Live API: A speech-to-speech realtime model with live video input.

LiveKit Agents supports Google AI through the Gemini API and Vertex AI.

Getting started

Use the Voice AI quickstart to build a voice AI app with Gemini. Select an STT-LLM-TTS pipeline model type and add the following components to build on Gemini.

Voice AI quickstart

Build your first voice AI app with Google Gemini.

Install the Google plugin:

pip install "livekit-agents[google]~=1.0"

Add your Google API key to your .env. file:

GOOGLE_API_KEY=<your-google-api-key>

Use the Google LLM component to initialize your AgentSession:

from livekit.plugins import google
# ...
# in your entrypoint function
session = AgentSession(
llm=google.LLM(
model="gemini-2.0-flash",
),
# ... stt, tts,vad, turn_detection, etc.
)

LiveKit Agents overview

LiveKit Agents is an open source framework for building realtime AI apps in Python and Node.js. It supports complex voice AI workflows with multiple agents and discrete processing steps, and includes built-in load balancing.

LiveKit provides SIP support for telephony integration and full-featured frontend SDKs in multiple languages. It uses WebRTC transport for end-user devices, enabling high-quality, low-latency realtime experiences. To learn more, see LiveKit Agents.

Google plugin documentation

The following links provide more information on each available Google component in LiveKit Agents.