Mixpeek Logo
    Schedule Demo
    PDF

    Page Segmentation

    Divide PDF pages into logical sections and regions

    Note: This playground provides simulated output to showcase functionality. No input data is processed or stored on our servers. Use this demo to explore the feature extractor's capabilities before integrating it into your application.

    Input

    Enter a URL to a pdf file

    Drag and drop a pdf file here, or click to browse

    Types of regions to detect. Default: title,paragraph,image,table,header,footer

    Minimum confidence threshold for region detection. Default: 0.75

    Output

    {
    "page_regions": [
    {
    "page": 1,
    "regions": [
    {
    "type": "title",
    "bbox": [
    150,
    50,
    500,
    100
    ],
    "text": "Annual Report 2023"
    },
    {
    "type": "image",
    "bbox": [
    200,
    120,
    450,
    250
    ],
    "description": "company logo"
    },
    {
    "type": "paragraph",
    "bbox": [
    100,
    280,
    550,
    450
    ],
    "text": "This annual report presents..."
    }
    ]
    }
    ],
    "layout_analysis": {
    "columns": 1,
    "reading_order": [
    "title",
    "image",
    "paragraph"
    ],
    "document_structure": "standard_report"
    }
    }