NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/allenai/Molmo2-O-7B
    Image Text To Texttransformersapache-2.0

    Molmo2-O-7B

    by allenai

    71Kdl/month
    21likes
    Identifier
    Model ID
    allenai/Molmo2-O-7B

    Tags

    transformerssafetensorsmolmo2image-text-to-textmultimodalolmomolmoconversationalcustom_codeendataset:allenai/Molmo2-Capdataset:allenai/Molmo2-VideoCapQAdataset:allenai/Molmo2-VideoSubtitleQAdataset:allenai/Molmo2-AskModelAnythingdataset:allenai/Molmo2-VideoPointdataset:allenai/Molmo2-VideoTrackdataset:allenai/Molmo2-MultiImageQAdataset:allenai/Molmo2-SynMultiImageQAdataset:allenai/Molmo2-MultiImagePointbase_model:allenai/Olmo-3-7B-Instructbase_model:finetune:allenai/Olmo-3-7B-Instructlicense:apache-2.0region:us

    Use Molmo2-O-7B on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder