NEWAgents can now see video via MCP.Try it now →
    Models/Document Question Answering/p786/donut-base-finetuned-docvqa

    donut-base-finetuned-docvqa

    by p786

    Identifier
    Model ID
    p786/donut-base-finetuned-docvqa

    Tags

    pytorchvision-encoder-decoderdonutimage-to-textvisiondocument-question-answeringarxiv:2111.15664license:mitregion:us

    Use donut-base-finetuned-docvqa on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder