Mixpeek Logo
    document

    PDF
    JSON
    Converter

    Convert PDF documents into clean, structured JSON output. Extracts text, tables, form fields, metadata, and document structure into a machine-readable JSON format suitable for API ingestion, database storage, and programmatic processing.

    Max file size: 200 MB
    Estimated: 2-15 sec per page
    1 input formats

    How It Works

    1

    Upload a PDF file or provide a URL to the Mixpeek API.

    2

    The document is classified as digital-native or scanned, with OCR applied as needed.

    3

    Layout analysis segments the document into pages, paragraphs, tables, and form fields.

    4

    An LLM maps extracted content to your target JSON schema or a default document schema.

    5

    Structured JSON is returned with pages, content blocks, tables, and metadata.

    Code Examples

    from mixpeek import Mixpeek
    client = Mixpeek(api_key="YOUR_API_KEY")
    result = client.convert(
    source="https://example.com/annual-report.pdf",
    from_format="pdf",
    to_format="json",
    options={
    "ocr_fallback": True,
    "extract_tables": True,
    "extract_images": False,
    "pages": "1-20"
    }
    )
    for page in result.pages:
    print(f"--- Page {page.number} ({len(page.blocks)} blocks) ---")
    for block in page.blocks:
    print(f" [{block.type}] {block.text[:100]}...")

    Use Cases

    Ingest PDF reports into data warehouses and analytics platforms
    Convert invoice PDFs into JSON for automated accounting workflows
    Parse legal contracts into structured clause-level JSON for review tools
    Transform product specification PDFs into JSON for e-commerce catalog imports

    Supported Input Formats

    PDF

    Quick Info

    Categorydocument
    Max File Size200 MB
    Est. Time2-15 sec per page

    Try This Conversion

    Get started with the Mixpeek API and convert your first file in minutes.

    Frequently Asked Questions

    Ready to convert pdf to json?

    Start using the Mixpeek PDF to JSON in minutes. Sign up for a free API key and follow the documentation to get started.