Interfaces

Configuration types, provider contracts, and event structures.

AnthropicLLMConfig — Configuration for the Anthropic LLM provider.
AssemblyAISTTConfig — Configuration options for the AssemblyAISTT provider.
AudioBufferQueueConfig — Configuration for an AudioBufferQueue instance.
AudioChunk — A chunk of audio data flowing through the pipeline.
AudioInputConfig — Configuration for audio input (microphone capture).
AudioInputProvider — Audio input provider interface for the ‘input’ pipeline role.
AudioMetadata — Metadata describing the format of audio data received from a provider.
AudioOutputConfig — Configuration for audio output (playback).
AudioOutputProvider — Audio output provider interface for the ‘output’ pipeline role.
BaseProvider — Base interface that all providers must implement.
BaseProviderConfig — Base configuration shared by all provider types.
BrowserAudioOutputConfig — Configuration for BrowserAudioOutput.
CartesiaOutputFormat — Cartesia output format configuration passed in each WebSocket message.
CartesiaTTSConfig — Configuration for the CartesiaTTS provider.
CompositeVoiceConfig — Main configuration type for the CompositeVoice SDK.
ConversationHistoryConfig — Configuration for multi-turn conversation history.
DeepgramFluxConfig — Configuration options for the DeepgramFlux provider.
DeepgramFluxOptions — Deepgram Flux transcription options passed to the V2 WebSocket connection.
DeepgramSTTConfig — Configuration options for the DeepgramSTT provider.
DeepgramTranscriptionOptions — Deepgram-specific transcription options passed as query parameters on the WebSocket connection URL.
DeepgramTTSConfig — Configuration for the DeepgramTTS provider.
DeepgramTTSOptions — Deepgram-specific TTS synthesis options.
EagerLLMConfig — Configuration for the eager LLM pipeline (speculative generation).
ElevenLabsSTTConfig — Configuration options for the ElevenLabs STT provider.
ElevenLabsTTSConfig — Configuration for the ElevenLabsTTS provider.
EventListenerMap — Typed mapping from event type strings to their corresponding listener signatures.
GeminiLLMConfig — Configuration for the Gemini LLM provider.
GroqLLMConfig — Configuration for the Groq LLM provider.
LLMGenerationOptions — Options for controlling a single LLM generation request.
LLMMessage — A single message in an LLM conversation.
LLMProvider — Large language model provider interface.
LLMProviderConfig — Configuration for large language model providers.
LLMToolCall — A tool invocation emitted by the LLM.
LLMToolDefinition — Definition of a tool the LLM can invoke.
LLMToolParameterSchema — Schema for a single tool parameter.
LLMToolResult — Result sent back to the LLM after executing a tool.
LoggingConfig — Configuration for SDK logging output.
MicrophoneInputConfig — Configuration for MicrophoneInput.
MistralLLMConfig — Configuration for the Mistral LLM provider.
NativeSTTConfig — Configuration options for the NativeSTT provider.
NativeTTSConfig — Configuration options for the NativeTTS provider.
OpenAICompatibleLLMConfig — Configuration for any OpenAI-compatible LLM provider.
OpenAILLMConfig — Configuration for the OpenAI LLM provider.
OpenAITTSConfig — Configuration for the OpenAITTS provider.
QueueOverflowEvent — Emitted when an AudioBufferQueue drops chunks due to overflow.
QueueStats — Statistics snapshot from an AudioBufferQueue instance.
QueueStatsEvent — Emitted when queue statistics are requested via getQueueStats().
ReconnectionConfig — Configuration for automatic WebSocket reconnection with exponential backoff.
ResolvedPipeline — A fully resolved 5-role pipeline with a provider assigned to each slot.
STTProviderConfig — Configuration for speech-to-text providers.
ToolAwareLLMProvider — LLM provider with optional tool use support.
TranscriptionPreflightEvent — Emitted when a provider sends a preflight/eager-end-of-turn signal.
TranscriptionResult — Result of a speech-to-text transcription.
TranscriptionSpeechFinalEvent — Emitted when an utterance is fully complete.
TTSProviderConfig — Configuration for text-to-speech providers.
TurnTakingConfig — Configuration for turn-taking behavior between the user and the agent.
WebLLMLLMConfig — Configuration for the WebLLM in-browser LLM provider.
WebLLMLoadProgress — Progress information emitted during model download and WebGPU shader compilation.