Interfaces
Configuration types, provider contracts, and event structures.
Configuration types, provider contracts, and event structures.
- AnthropicLLMConfig — Configuration for the Anthropic LLM provider.
- AssemblyAISTTConfig — Configuration options for the AssemblyAISTT provider.
- AudioBufferQueueConfig — Configuration for an AudioBufferQueue instance.
- AudioChunk — A chunk of audio data flowing through the pipeline.
- AudioInputConfig — Configuration for audio input (microphone capture).
- AudioInputProvider — Audio input provider interface for the ‘input’ pipeline role.
- AudioMetadata — Metadata describing the format of audio data received from a provider.
- AudioOutputConfig — Configuration for audio output (playback).
- AudioOutputProvider — Audio output provider interface for the ‘output’ pipeline role.
- BaseProvider — Base interface that all providers must implement.
- BaseProviderConfig — Base configuration shared by all provider types.
- BrowserAudioOutputConfig — Configuration for BrowserAudioOutput.
- CartesiaOutputFormat — Cartesia output format configuration passed in each WebSocket message.
- CartesiaTTSConfig — Configuration for the CartesiaTTS provider.
- CompositeVoiceConfig — Main configuration type for the CompositeVoice SDK.
- ConversationHistoryConfig — Configuration for multi-turn conversation history.
- DeepgramFluxConfig — Configuration options for the DeepgramFlux provider.
- DeepgramFluxOptions — Deepgram Flux transcription options passed to the V2 WebSocket connection.
- DeepgramSTTConfig — Configuration options for the DeepgramSTT provider.
- DeepgramTranscriptionOptions — Deepgram-specific transcription options passed as query parameters on the WebSocket connection URL.
- DeepgramTTSConfig — Configuration for the DeepgramTTS provider.
- DeepgramTTSOptions — Deepgram-specific TTS synthesis options.
- EagerLLMConfig — Configuration for the eager LLM pipeline (speculative generation).
- ElevenLabsSTTConfig — Configuration options for the ElevenLabs STT provider.
- ElevenLabsTTSConfig — Configuration for the ElevenLabsTTS provider.
- EventListenerMap — Typed mapping from event type strings to their corresponding listener signatures.
- GeminiLLMConfig — Configuration for the Gemini LLM provider.
- GroqLLMConfig — Configuration for the Groq LLM provider.
- LLMGenerationOptions — Options for controlling a single LLM generation request.
- LLMMessage — A single message in an LLM conversation.
- LLMProvider — Large language model provider interface.
- LLMProviderConfig — Configuration for large language model providers.
- LLMToolCall — A tool invocation emitted by the LLM.
- LLMToolDefinition — Definition of a tool the LLM can invoke.
- LLMToolParameterSchema — Schema for a single tool parameter.
- LLMToolResult — Result sent back to the LLM after executing a tool.
- LoggingConfig — Configuration for SDK logging output.
- MicrophoneInputConfig — Configuration for MicrophoneInput.
- MistralLLMConfig — Configuration for the Mistral LLM provider.
- NativeSTTConfig — Configuration options for the NativeSTT provider.
- NativeTTSConfig — Configuration options for the NativeTTS provider.
- OpenAICompatibleLLMConfig — Configuration for any OpenAI-compatible LLM provider.
- OpenAILLMConfig — Configuration for the OpenAI LLM provider.
- OpenAITTSConfig — Configuration for the OpenAITTS provider.
- QueueOverflowEvent — Emitted when an AudioBufferQueue drops chunks due to overflow.
- QueueStats — Statistics snapshot from an AudioBufferQueue instance.
- QueueStatsEvent — Emitted when queue statistics are requested via getQueueStats().
- ReconnectionConfig — Configuration for automatic WebSocket reconnection with exponential backoff.
- ResolvedPipeline — A fully resolved 5-role pipeline with a provider assigned to each slot.
- STTProviderConfig — Configuration for speech-to-text providers.
- ToolAwareLLMProvider — LLM provider with optional tool use support.
- TranscriptionPreflightEvent — Emitted when a provider sends a preflight/eager-end-of-turn signal.
- TranscriptionResult — Result of a speech-to-text transcription.
- TranscriptionSpeechFinalEvent — Emitted when an utterance is fully complete.
- TTSProviderConfig — Configuration for text-to-speech providers.
- TurnTakingConfig — Configuration for turn-taking behavior between the user and the agent.
- WebLLMLLMConfig — Configuration for the WebLLM in-browser LLM provider.
- WebLLMLoadProgress — Progress information emitted during model download and WebGPU shader compilation.