How is image-to-searchable-text different from image-to-text?

Image-to-text extracts raw text content from images. Image-to-searchable-text goes further by preserving document structure (paragraphs, columns, headers), maintaining reading order, and normalizing the output specifically for full-text search indexing. It also applies heavier preprocessing like deskew and binarization to maximize OCR accuracy on document images.

Does it work with photographed documents that are tilted or skewed?

Yes. The preprocessing pipeline includes automatic deskew detection and correction. Images rotated up to 45 degrees are automatically straightened. For perspective distortion (e.g., photographed at an angle), set `perspective_correction` to true.

Can I use this to create searchable PDFs from scanned images?

Yes. Set `output_format` to 'searchable_pdf' and the API will return a PDF with an invisible text layer overlaid on the original image. This is the standard format for making scanned documents searchable while preserving visual fidelity.

What languages are supported for OCR in document images?

Over 100 languages and scripts are supported, including Latin, Cyrillic, CJK, Arabic, Devanagari, Thai, and Korean. Multi-language documents are detected and processed automatically. You can provide a `language_hint` parameter to improve accuracy for known languages.

document

Image
Searchable Text
Converter

Convert images containing text into fully searchable, indexed content using advanced OCR combined with layout understanding. Preserves document structure including paragraphs, columns, headers, and reading order for downstream search and retrieval.

Max file size: 50 MB

Estimated: 1-5 sec per image

5 input formats

How It Works

Upload an image or provide a URL to the Mixpeek API.

The image is preprocessed with deskew, binarization, and contrast enhancement.

Layout analysis detects columns, paragraphs, headers, and reading order.

OCR extracts character-level text with confidence scores per word.

Structured, search-ready text is returned with preserved reading order and optional positional metadata.

Code Examples

from mixpeek import Mixpeek

client = Mixpeek(api_key="YOUR_API_KEY")

result = client.convert(
    source="https://example.com/scanned-contract.png",
    from_format="image",
    to_format="searchable-text",
    options={
        "deskew": True,
        "preserve_layout": True,
        "language_hint": "en",
        "output_format": "structured"
    }
)

for block in result.text_blocks:
    print(f"[{block.type}] {block.text}")
print(f"Full text: {result.full_text}")

Use Cases

Digitize scanned documents and make them full-text searchable

Extract text from photographed whiteboards and handwritten notes

Process scanned receipts and invoices for accounting systems

Convert historical document archives into searchable digital collections

Supported Input Formats

JPEG

PNG

WebP

TIFF

BMP

Quick Info

Categorydocument

Max File Size50 MB

Est. Time1-5 sec per image

Extractorimage-descriptor

Try This Conversion

Get started with the Mixpeek API and convert your first file in minutes.

Frequently Asked Questions

Related Converters

Image

Text

Image to Text

Extract all readable text from images using advanced OCR combined with a vision-language model. Handles printed text, handwriting, complex layouts, receipts, signs, and multi-language documents.

Image

Embeddings

Image to Embeddings

Convert images into dense vector representations using state-of-the-art vision models. Embeddings capture semantic visual features and can be used for similarity search, clustering, and cross-modal retrieval.

PDF

Text

PDF to Text

Extract clean, structured text from PDF documents including scanned pages, multi-column layouts, headers/footers, and tables. Combines traditional parsing with OCR and layout analysis for maximum accuracy.

PDF

JSON

PDF to Structured Data

Extract structured key-value pairs, tables, and form fields from PDF documents. Uses layout analysis and LLM extraction to produce clean JSON output, even from complex forms and invoices.

Ready to convert image to searchable text?

Start using the Mixpeek Image to Searchable Text in minutes. Sign up for a free API key and follow the documentation to get started.

ImageSearchable TextConverter