NEWAgents can now see video via MCP.Try it now →
    Models/Visual Question Answering/nectec/Pathumma-llm-vision-1.0.0

    Pathumma-llm-vision-1.0.0

    by nectec

    Identifier
    Model ID
    nectec/Pathumma-llm-vision-1.0.0

    Tags

    safetensorsidefics3visual-question-answeringthenarxiv:2408.12637base_model:HuggingFaceM4/Idefics3-8B-Llama3base_model:finetune:HuggingFaceM4/Idefics3-8B-Llama3region:us

    Use Pathumma-llm-vision-1.0.0 on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder