BiRefNet
by ZhengPeng7
High-resolution foreground segmentation for object masks and visual evidence cleanup
ZhengPeng7/BiRefNetmixpeek://image_extractor@v1/zhengpeng7_birefnet_v1Overview
BiRefNet is the official checkpoint for Bilateral Reference for High-Resolution Dichotomous Image Segmentation. It targets foreground/background masks, salient object segmentation, and related cases where the useful evidence is an object region rather than the whole image.
On Mixpeek, BiRefNet can turn images or sampled video frames into mask metadata. Agents can use those masks to filter frames with clear foreground objects, crop objects before embedding, or remove distracting background before downstream OCR, detection, captioning, or similarity search.
Architecture
Image-segmentation model for high-resolution dichotomous segmentation. The Hugging Face card lists Transformers support through AutoModelForImageSegmentation, MIT licensing, and tags for background removal, mask generation, camouflaged object detection, and salient object detection.
Mixpeek SDK Integration
import { Mixpeek } from "mixpeek";
const mx = new Mixpeek({ apiKey: "API_KEY" });
// Managed: create a collection over a bucket; Mixpeek runs this model's extractor
const collection = await mx.collections.create({
namespace_id: "my-namespace",
collection_name: "my-collection",
source: { type: "bucket", bucket_ids: ["bkt_your_bucket"] },
feature_extractor: {
feature_extractor_name: "segmentation",
version: "v1",
parameters: { model_id: "ZhengPeng7/BiRefNet" },
},
});Capabilities
- Foreground/background mask generation
- High-resolution dichotomous image segmentation
- Background removal and object isolation
- Useful pre-processing for embeddings, OCR, and VLM captioning
- MIT license
Use Cases on Mixpeek
Benchmarks
| Dataset | Metric | Score | Source |
|---|---|---|---|
| Hugging Face | Monthly downloads | 824K | HF model metadata, June 2026 |
| BiRefNet task coverage | Segmentation tags | DIS, camouflaged, salient object | BiRefNet model card |
Performance
Run before visual embeddings when foreground isolation improves retrieval quality
Common Pipeline Companions
Explore on Mixpeek
Compare alternatives in this category
Hand-picked tools & platforms compared
Deep-dive technical guide
See how Mixpeek runs models as extractors
Store & search embeddings at scale
Usage-based pricing for pipelines
Compare models, APIs & infrastructure
Specification
Research Paper
Bilateral Reference for High-Resolution Dichotomous Image Segmentation
arxiv.orgBuild a pipeline with BiRefNet
Add this model to a processing pipeline alongside other extractors. Combine with retrieval stages for end-to-end search.
Open Studio