文档
将 Sayd 集成到你的应用所需的一切。
Talk API
Push-to-talk voice input with AI-powered transcript cleaning. WebSocket streaming + LLM processing.
Listen API
Real-time speech-to-text via WebSocket. Raw transcription without AI cleaning — you control the pipeline.
Transcribe API
Upload audio files for async transcription. Supports WAV, MP3, and more. Results via polling.
VAD API
Voice Activity Detection — detect speech segments or check if audio contains speech.
Getting Started
The fastest way to get started:
- Create a free account and get your API key
- Follow the Quick Start guide
- Explore the Talk API documentation
Why Sayd?
| Feature | Sayd | Traditional APIs |
|---|---|---|
| Latency | < 200ms first byte | 500ms+ |
| Pricing | Token-based, from $0 | Per-minute, minimums |
| Agent Integration | Native support | Manual wiring |