NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/microsoft/Phi-3.5-vision-instruct
    Image Text To Texttransformersmit

    Phi-3.5-vision-instruct

    by microsoft

    1.5Mdl/month
    732likes
    Identifier
    Model ID
    microsoft/Phi-3.5-vision-instruct

    Tags

    transformerssafetensorsphi3_vtext-generationnlpcodevisionimage-text-to-textconversationalcustom_codemultilingualarxiv:2404.14219license:mitregion:us

    Use Phi-3.5-vision-instruct on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder