NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/google/paligemma2-3b-ft-docci-448
    Image Text To Texttransformersgemma

    paligemma2-3b-ft-docci-448

    by google

    43Kdl/month
    13likes
    Identifier
    Model ID
    google/paligemma2-3b-ft-docci-448

    Tags

    transformerssafetensorspaligemmaimage-text-to-textarxiv:2407.07726arxiv:2408.00118arxiv:2310.09199arxiv:2303.15343arxiv:1706.03762arxiv:2010.11929arxiv:2412.03555arxiv:2209.06794arxiv:2209.04372arxiv:2103.01913arxiv:1908.04913arxiv:1906.02467arxiv:2203.10244arxiv:2205.12522arxiv:2104.12756arxiv:1608.00272arxiv:1511.02283arxiv:1905.13648arxiv:2110.11624arxiv:2108.03353arxiv:1810.12440arxiv:1904.03493arxiv:2010.04295arxiv:1511.09207license:gemmatext-generation-inference

    Use paligemma2-3b-ft-docci-448 on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder