NEWAgents can now see video via MCP.Try it now →
    Models/Image To Text/phxember/Uni-MuMER-Qwen3-VL-2B
    Image To Texttransformersapache-2.0

    Uni-MuMER-Qwen3-VL-2B

    by phxember

    Identifier
    Model ID
    phxember/Uni-MuMER-Qwen3-VL-2B

    Tags

    transformerssafetensorsqwen3_vlimage-text-to-textuni-mumerhmermath-ocrhandwritten-mathlatexqwen3-vlvision-languageimage-to-textendataset:phxember/Uni-MuMER-Dataarxiv:2505.23566base_model:Qwen/Qwen3-VL-2B-Instructbase_model:finetune:Qwen/Qwen3-VL-2B-Instructlicense:apache-2.0endpoints_compatibleregion:us

    Use Uni-MuMER-Qwen3-VL-2B on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder