367Kdl/month
1,593likes
Identifier
Model ID
microsoft/Phi-4-multimodal-instructTags
transformerssafetensorsphi4mmtext-generationnlpcodeaudioautomatic-speech-recognitionspeech-summarizationspeech-translationvisual-question-answeringphi-4-multimodalphiphi-4-minicustom_codemultilingualarzhcsdanlenfifrdehehuitjako
Use Phi-4-multimodal-instruct on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationmicrosoft
TaskAutomatic Speech Recognition
Librarytransformers
Licensemit
Downloads/mo367K
Likes1,593
View on HuggingFace
See model card, files, and community discussion
Related Automatic Speech Recognition Models
pyannote/speaker-diarization-3.1
10.2M
argmaxinc/whisperkit-coreml
8.1M
openai/whisper-large-v3-turbo
7.0M
openai/whisper-large-v3
4.9M
jonatasgrosman/wav2vec2-large-xlsr-53-russian
4.9M
jonatasgrosman/wav2vec2-large-xlsr-53-portuguese
3.8M
MahmoudAshraf/mms-300m-1130-forced-aligner
3.7M
pyannote/voice-activity-detection
2.7M