NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/OpenGVLab/InternVL2_5-4B
    Image Text To Texttransformersmit

    InternVL2_5-4B

    by OpenGVLab

    40Kdl/month
    57likes
    Identifier
    Model ID
    OpenGVLab/InternVL2_5-4B

    Tags

    transformerstensorboardsafetensorsinternvl_chatfeature-extractioninternvlcustom_codeimage-text-to-textconversationalmultilingualdataset:HuggingFaceFV/finevideoarxiv:2312.14238arxiv:2404.16821arxiv:2410.16261arxiv:2412.05271base_model:OpenGVLab/InternViT-300M-448px-V2_5base_model:merge:OpenGVLab/InternViT-300M-448px-V2_5base_model:Qwen/Qwen2.5-3B-Instructbase_model:merge:Qwen/Qwen2.5-3B-Instructlicense:mitregion:us

    Use InternVL2_5-4B on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder