BiRefNet
by ZhengPeng7
High-resolution foreground segmentation for object masks and visual evidence cleanup
ZhengPeng7/BiRefNetmixpeek://image_extractor@v1/zhengpeng7_birefnet_v1Overview
BiRefNet is the official checkpoint for Bilateral Reference for High-Resolution Dichotomous Image Segmentation. It targets foreground/background masks, salient object segmentation, and related cases where the useful evidence is an object region rather than the whole image.
On Mixpeek, BiRefNet can turn images or sampled video frames into mask metadata. Agents can use those masks to filter frames with clear foreground objects, crop objects before embedding, or remove distracting background before downstream OCR, detection, captioning, or similarity search.
Architecture
Image-segmentation model for high-resolution dichotomous segmentation. The Hugging Face card lists Transformers support through AutoModelForImageSegmentation, MIT licensing, and tags for background removal, mask generation, camouflaged object detection, and salient object detection.
Mixpeek SDK Integration
import { Mixpeek } from "mixpeek";const mx = new Mixpeek({ apiKey: "API_KEY" });await mx.collections.ingest({collection_id: "foreground-index",source: { url: "s3://product-media/images/" },feature_extractors: [{feature: "segmentation",model: "ZhengPeng7/BiRefNet",params: {output_masks: true,store_crops: true}}]});
Capabilities
- Foreground/background mask generation
- High-resolution dichotomous image segmentation
- Background removal and object isolation
- Useful pre-processing for embeddings, OCR, and VLM captioning
- MIT license
Use Cases on Mixpeek
Benchmarks
| Dataset | Metric | Score | Source |
|---|---|---|---|
| Hugging Face | Monthly downloads | 824K | HF model metadata, June 2026 |
| BiRefNet task coverage | Segmentation tags | DIS, camouflaged, salient object | BiRefNet model card |
Performance
Run before visual embeddings when foreground isolation improves retrieval quality
Specification
Research Paper
Bilateral Reference for High-Resolution Dichotomous Image Segmentation
arxiv.orgBuild a pipeline with BiRefNet
Add this model to a processing pipeline alongside other extractors. Combine with retrieval stages for end-to-end search.
Open Studio