Cutsio
A collaborative video editing platform for professional post-production teams working with raw cinema camera footage across commercials, film, and episodic television.
-99%
Footage Search Time
Fully Automated
Raw Format Support
-86%
Editor Time on Search
Full Lineage
Pipeline Audit Coverage
The Challenge
Professional video editors working with RED R3D and ARRI RAW footage had no way to search their media libraries visually. Each project involved thousands of hours of raw clips spread across cloud storage, and finding the right shot meant scrubbing through footage manually or relying on sparse, hand-typed notes. Editors spent 30-40% of their time just looking for clips instead of cutting. Converting raw cinema camera formats for preview added another bottleneck, since tools like DaVinci Resolve had to run locally on each editor's machine before footage could even be reviewed.
Pipeline Architecture
End-to-end flow from raw footage ingestion to visual search, with full audit trail at every stage.
Mux Selective Sync
Filters assets by naming convention, MIME type, and file size. Controls which customer-tier footage enters the pipeline.
RAW Video Conversion
Custom plugin (v3.0) decodes RED R3D via REDline and ARRI RAW via ART CMD. Transcodes to H.264 server-side, no DaVinci needed.
Multimodal Decomposition
Extracts scene-level embeddings: composition, subjects, motion, color palette, and on-screen text from converted footage.
Visual Search
Editors search by reference frame, natural language, or both. Results ranked by scene similarity with sub-second latency.
Audit Trail
Every ingestion, conversion, and search is logged. Producers see what was processed, when, and how, with full lineage.
The Solution
Mixpeek's pipeline connects directly to Cutsio's Mux video infrastructure through a selective sync that filters which assets get indexed by naming convention, file type, and size. A custom video conversion plugin with built-in REDline and ARRI Reference Tool CMD decodes raw cinema formats server-side without requiring DaVinci Resolve. Converted footage flows into a multimodal decomposition collection that extracts scene-level embeddings, capturing composition, subjects, motion, color palette, and on-screen text. A visual search retriever lets editors find shots by uploading a reference frame, describing a scene in natural language, or combining both. Every step is tracked through Mixpeek's audit trail, giving producers full visibility into what was processed, when, and how.
Implementation
Cutsio's pipeline runs as two chained collections: the first handles raw format detection and transcoding (R3D, ARRI RAW, and standard codecs), and the second performs multimodal feature extraction on the converted output. A Mux selective sync feeds the pipeline with file filters that control which assets are indexed per customer tier. The custom conversion plugin was developed as a v3 iteration that eliminated the external DaVinci dependency by leveraging RED and ARRI's native CLI tools pre-installed in the Mixpeek engine base image. End-to-end, a raw R3D file uploaded to Mux becomes visually searchable within minutes.
Results
Before and after Mixpeek
-99%
Footage Search Time
Fully Automated
Raw Format Support
-86%
Editor Time on Search
Full Lineage
Pipeline Audit Coverage
Customer testimonial
"Our editors used to lose half their day hunting for the right take. Now they describe what they need and the exact frame comes back in seconds. The fact that it handles RED and ARRI raw natively, without DaVinci in the loop, was the unlock."
Get Similar Results
See how Mixpeek can deliver measurable impact for your Media & Entertainment organization. Book a personalized demo to discuss your specific challenges.