NEWAgents can now see video via MCP.Try it now →
    Models/Automatic Speech Recognition/microsoft/Phi-4-multimodal-instruct

    Phi-4-multimodal-instruct

    by microsoft

    367Kdl/month
    1,593likes
    Identifier
    Model ID
    microsoft/Phi-4-multimodal-instruct

    Tags

    transformerssafetensorsphi4mmtext-generationnlpcodeaudioautomatic-speech-recognitionspeech-summarizationspeech-translationvisual-question-answeringphi-4-multimodalphiphi-4-minicustom_codemultilingualarzhcsdanlenfifrdehehuitjako

    Use Phi-4-multimodal-instruct on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder