NEWAgents can now see video via MCP.Try it now →
    Models/Image To Text/naver-clova-ix/donut-base-finetuned-rvlcdip
    Image To Texttransformersmit

    donut-base-finetuned-rvlcdip

    by naver-clova-ix

    Identifier
    Model ID
    naver-clova-ix/donut-base-finetuned-rvlcdip

    Tags

    transformerspytorchvision-encoder-decoderimage-text-to-textdonutimage-to-textvisionarxiv:2111.15664license:mitendpoints_compatibleregion:us

    Use donut-base-finetuned-rvlcdip on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder