Fireworks RFT now available! Fine-tune open models that outperform frontier models. Try today

Model Library
/Allen Institute for AI/Molmo2-8B

Molmo2-8B

Ready
fireworks/molmo2-8b

    Molmo2 is a family of open vision-language models developed by the Allen Institute for AI (Ai2) that support image, video and multi-image understanding and grounding. Molmo 2 (8B) is Qwen 3-based and Ai2's best overall model for video grounding and QA.

    Molmo2-8B API Features

    On-demand Deployment

    Docs

    On-demand deployments give you dedicated GPUs for Molmo2-8B using Fireworks' reliable, high-performance system with no rate limits.

    Metadata

    State
    Ready
    Created on
    1/6/2026
    Kind
    Base model
    Provider
    Allen Institute for AI
    Hugging Face
    Molmo2-8B

    Specification

    Calibrated
    No
    Mixture-of-Experts
    No
    Parameters
    8B

    Supported Functionality

    Fine-tuning
    Not supported
    Serverless
    Not supported
    Context Length
    36.9k tokens
    Function Calling
    Not supported
    Embeddings
    Not supported
    Rerankers
    Not supported
    Support image input
    Supported