NEWAgents can now see video via MCP.Try it now →
    Models/Image To Text/nyu-visionx/Cambrian-S-3B
    Image To Texttransformersapache-2.0

    Cambrian-S-3B

    by nyu-visionx

    Identifier
    Model ID
    nyu-visionx/Cambrian-S-3B

    Tags

    transformerssafetensorscambrian_qwentext-generationmultimodalvideo-understandingspatial-reasoningvision-languageimage-to-textendataset:nyu-visionx/VSI-590Karxiv:2511.04670base_model:Qwen/Qwen2.5-3B-Instructbase_model:finetune:Qwen/Qwen2.5-3B-Instructlicense:apache-2.0endpoints_compatibleregion:us

    Use Cambrian-S-3B on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder