NEWAgents can now see video via MCP.Try it now →
    Models/Automatic Speech Recognition/espnet/owsm_ctc_v4_1B

    owsm_ctc_v4_1B

    by espnet

    Identifier
    Model ID
    espnet/owsm_ctc_v4_1B

    Tags

    espnetaudioautomatic-speech-recognitionspeech-translationlanguage-identificationmultilingualdataset:espnet/yodas_owsmv4arxiv:2406.09282arxiv:2401.16658arxiv:2309.13876license:cc-by-4.0eval-resultsregion:us

    Use owsm_ctc_v4_1B on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder