Mixpeek Logo
    document

    PDF
    Table Data
    Converter

    Extract tables from PDF documents and convert them into structured formats like JSON arrays, CSV, or Excel. Handles complex table layouts with merged cells, nested headers, multi-page tables, and borderless tables using AI-powered layout detection.

    Max file size: 200 MB
    Estimated: 2-10 sec per page
    1 input formats

    How It Works

    1

    Upload a PDF file or provide a URL to the Mixpeek API.

    2

    AI-powered layout analysis detects all table regions on each page.

    3

    Cell boundaries are identified using a combination of rule detection and machine learning.

    4

    Merged cells, nested headers, and multi-page continuation tables are resolved into clean row/column structures.

    5

    Tables are returned as structured arrays with headers, rows, and optional type inference per column.

    Code Examples

    from mixpeek import Mixpeek
    client = Mixpeek(api_key="YOUR_API_KEY")
    result = client.convert(
    source="https://example.com/financial-report.pdf",
    from_format="pdf",
    to_format="table-data",
    options={
    "stitch_multipage_tables": True,
    "table_output_format": "json",
    "include_headers": True,
    "pages": "5-15"
    }
    )
    for table in result.tables:
    print(f"Table on page {table.page}: {table.num_rows} rows x {table.num_cols} cols")
    for row in table.rows[:3]:
    print(f" {row}")

    Use Cases

    Extract financial tables from SEC filings and annual reports
    Pull pricing tables from vendor quotes and proposals
    Digitize scientific data tables from research papers
    Convert regulatory compliance tables into spreadsheet-ready formats

    Supported Input Formats

    PDF

    Quick Info

    Categorydocument
    Max File Size200 MB
    Est. Time2-10 sec per page

    Try This Conversion

    Get started with the Mixpeek API and convert your first file in minutes.

    Frequently Asked Questions

    Ready to convert pdf to table data?

    Start using the Mixpeek PDF to Table Data in minutes. Sign up for a free API key and follow the documentation to get started.