Streaming Transcription
Convert audio to text in real-time using WebSocket connections. Perfect for voice agents and live applications.Quick Start
Available Models:fireworks-asr-large: Cost efficient model for real-time transcription over web-socketsfireworks-asr-v2: Next generation and ultra-low latency audio streaming for real-time transcription over web-sockets
Pre-recorded Transcription
Convert audio files to text. Supports files up to 1GB in formats like MP3, FLAC, and WAV. Transcribe multiple hours of audio in minutes.Quick Start
For a working example of pre-recorded transcription see the Python notebook Available Models:whisper-v3: Highest accuracy- model=
whisper-v3 - base_url=
https://audio-prod.api.fireworks.ai
- model=
whisper-v3-turbo: Faster processing- model=
whisper-v3-turbo - base_url=
https://audio-turbo.api.fireworks.ai
- model=
Pre-recorded Translation
Translate audio from any of our supported languages to English. Supports files up to 1GB in formats like MP3, FLAC, and WAV.Quick Start
Supported Languages
We support 95+ languages including English, Spanish, French, German, Chinese, Japanese, Russian, Portuguese, and many more. See the complete language list.Common Use Cases
- Call Center / Customer Service: Transcribe or translate customer calls
- Note Taking: Transcribe audio for automated note taking
Next Steps
- Explore advanced features like speaker diarization and custom prompts
- Contact us at [email protected] for dedicated endpoints and enterprise features