Fireworks RFT now available! Fine-tune open models that outperform frontier models. Try today

Model Library
/OpenAI/Whisper V3 Turbo
OpenAi Logo MArk

Whisper V3 Turbo

Ready
fireworks/whisper-v3-turbo

    Whisper large-v3-turbo is a finetuned version of a pruned Whisper large-v3. In other words, it's the exact same model, except that the number of decoding layers have reduced from 32 to 4. As a result, the model is way faster, at the expense of a minor quality degradation.

    Whisper V3 Turbo API Features

    Serverless

    Docs

    Immediately run model on pre-configured GPUs and pay-per-token

    On-demand Deployment

    Docs

    On-demand deployments give you dedicated GPUs for Whisper V3 Turbo using Fireworks' reliable, high-performance system with no rate limits.

    0

    Whisper V3 Turbo FAQs

    What is Whisper V3 Turbo and who developed it?

    Whisper V3 Turbo is a fine-tuned variant of OpenAI's Whisper large-v3 model in which the decoder layers were reduced from 32 to 4, delivering much faster inference with only minor quality loss. The model is served on Fireworks AI and the underlying model was created by OpenAI.

    What applications and use-cases does Whisper V3 Turbo excel at?

    Whisper V3 Turbo is optimized for:

    • Automatic speech recognition (ASR) across 99 languages
    • Zero-shot speech-to-English translation
    • Batch or near-real-time transcription pipelines where speed is critical
    What is the maximum context length for Whisper V3 Turbo?

    Whisper models process up to 30 seconds of audio per forward pass (known as the model's "receptive field").

    What is the usable context window?

    The usable window is effectively the 30-second receptive field. Longer recordings must be chunked or streamed in sequential windows.

    What are known failure modes of Whisper V3 Turbo?

    Known limitations include:

    • Hallucinating words not present in the audio
    • Higher error rates on low-resource languages
    • Difficulty with certain accents and demographic groups
    • Potential for repetitive text generation due to seq-to-seq design
    How many parameters does Whisper V3 Turbo have?

    Whisper V3 Turbo has approximately 809 million parameters.

    What license governs commercial use of Whisper V3 Turbo?

    Whisper V3 Turbo was released under the MIT License.

    Metadata

    State
    Ready
    Created on
    N/A
    Kind
    Unknown
    Provider
    OpenAI

    Specification

    Calibrated
    No
    Mixture-of-Experts
    No
    Parameters
    N/A

    Supported Functionality

    Fine-tuning
    Not supported
    Serverless
    Supported
    Serverless LoRA
    Not supported
    Context Length
    N/A
    Function Calling
    Not supported
    Embeddings
    Not supported
    Rerankers
    Not supported
    Support image input
    Not supported