Mixpeek Logo
    media

    Video
    Images
    Converter

    Automatically detect scene changes and extract representative keyframes from any video. Each keyframe includes a timestamp, scene label, and optional caption generated by a vision model.

    Max file size: 5 GB
    Estimated: 1-5 min per hour of video
    5 input formats

    How It Works

    1

    Upload your video file or provide a URL.

    2

    Scene-change detection identifies visual transition points.

    3

    Representative frames are extracted at each transition.

    4

    A vision model captions each keyframe and assigns a scene label.

    5

    Keyframes are returned as images with metadata.

    Code Examples

    from mixpeek import Mixpeek
    client = Mixpeek(api_key="YOUR_API_KEY")
    result = client.convert(
    source="https://example.com/promo.mp4",
    from_format="video",
    to_format="keyframes",
    options={
    "sensitivity": 0.5,
    "max_frames": 50,
    "include_captions": True
    }
    )
    for frame in result.keyframes:
    print(frame.timestamp, frame.caption)

    Use Cases

    Build visual indexes for video libraries
    Generate storyboards for film and advertising review
    Create thumbnail galleries for e-learning platforms
    Power visual search across video catalogs

    Supported Input Formats

    MP4
    MOV
    AVI
    MKV
    WebM

    Quick Info

    Categorymedia
    Max File Size5 GB
    Est. Time1-5 min per hour of video

    Try This Conversion

    Get started with the Mixpeek API and convert your first file in minutes.

    Frequently Asked Questions

    Ready to convert video to images?

    Start using the Mixpeek Video to Keyframes in minutes. Sign up for a free API key and follow the documentation to get started.