Autype Lens

#AI-powered document understanding

Autype Lens goes beyond plain OCR. It combines optical character recognition with vision-language models to extract text, layout, styling, and structure from any document.

Try Autype Lens Read the docs

Document Input

PDF, DOCX, ODT

Autype Lens

Extract content

Detect layout

Identify styling

Classify & extract

Structured Output

md / mdd / json

Document Input

PDF, DOCX, ODT

Autype Lens

Extract content

Detect layout

Identify styling

Classify & extract

Structured Output

md / mdd / json

Two AI technologies, one pipeline

Traditional OCR captures text but loses everything else. Vision-language models understand pages visually but struggle with precision. Autype Lens combines both into a single pipeline (built on fine-tuned open-source models) to deliver results neither approach could achieve alone.

Optical Character Recognition

Precision text extraction at character level. Our fine-tuned OCR models handle complex layouts, multi-column pages, and embedded tables with high accuracy, including scanned documents.

Vision-Language Models

Page-level visual comprehension. Fine-tuned VLMs analyze the full page as an image to understand heading hierarchies, font styles, color schemes, margins, and spatial relationships between elements.

Unified extraction pipeline

The OCR pass delivers precise text. The VLM pass delivers document semantics. Our pipeline merges both, validates output against JSON schemas, and retries automatically if needed.

Built on fine-tuned open-source models

Output validated against JSON schemas

Automatic retry with error feedback

Four ways to understand documents

Each operation targets a different document understanding task. All are accessible via a single REST API endpoint.

Acme Corp

Quarterly Report

Quarterly Report Q4

Revenue grew by 23% compared to last quarter, driven by enterprise adoption and new product launches across all regions.

●Onboarded 18 new clients
●Retention rate at 94%
●3 new markets planned for Q1

chart-q4.png

Key Metrics

MetricValueChange

Revenue€1.2M+23%

Clients142+18

Margin18%+3%

International expansion is on track with 3 new markets planned for Q1 2025. Operating margins improved to 18%.

Page 1 / 3

Extracted output

# Quarterly Report Q4

Revenue grew by **23%** compared to
last quarter, driven by enterprise
adoption across all regions.

Key achievements this quarter:

- Onboarded **18 new clients**
- Retention rate at 94%
- 3 new markets planned for Q1

![Q4 Chart](/output/chart-q4.png)

## Key Metrics

:::table{}headerBg=#f0f0f0 rowAltBg=#fafafa{}
| Metric   | Value  | Change |
|----------|--------|--------|
| Revenue  | €1.2M  | +23%   |
| Clients  | 142    | +18    |
| Margin   | 18%    | +3%    |
:::

Text, headings, tables, and structure preserved as Markdown or JSON.

OCR

Smart OCR

Convert documents to Markdown, Autype extended Markdown (with styling & defaults), or complete Autype JSON. All formats are ready to edit or re-render. Supports page selection.

Sterling & Associates

Confidential

Service Agreement

This Service Agreement (“Agreement”) is entered into as of March 15, 2025, by and between Sterling & Associates (“Provider”) and Globex Industries (“Client”).

1. Scope of Services

The Provider shall deliver consulting services as described in Exhibit A, including strategic planning, market analysis, and quarterly reviews.

2. Compensation

The Client agrees to pay a monthly retainer of €8,500 for the duration of this Agreement, due within 30 days of invoice.

3. Term & Termination

This Agreement shall commence on the date above and continue for a period of 12 months, unless terminated by either party with 30 days written notice.

Sterling & AssociatesPage 1 of 4

Request

{
  "fileId": "550e8400-...",
  "labels": [
    "contract", "invoice",
    "report", "letter"
  ]
}

Classification result

{
  "category": "contract",
  "confidence": 0.95,
  "labels": [
    { "name": "contract", "score": 0.95 },
    { "name": "letter", "score": 0.03 },
    { "name": "report", "score": 0.02 }
  ]
}

Returns the matched category with a confidence score.

Classify

Document classification

Automatically categorize documents into your custom categories. Upload a document, provide your labels, and get back the best match with a confidence score.

Acme Corp

123 Business St, Munich

INVOICE

INV-2025-0042

Bill to:

Globex Industries

Berlin, Germany

Date: 2025-03-15

Due: 2025-04-15

DescriptionQtyAmount

Consulting Services40h€3,500.00

Travel Expenses1€750.00

Total€4,250.00

All amounts in EUR

Payment terms: 30 days net · IBAN: DE89 3704 0044 0532 0130 00

Acme Corp · VAT ID: DE123456789

Request

{
  "fileId": "550e8400-...",
  "schema": {
    "invoiceNumber": "string",
    "date": "string",
    "vendor": "string",
    "total": "number",
    "currency": "string",
    "lineItems": "array"
  }
}

Extracted fields

{
  "invoiceNumber": "INV-2025-0042",
  "date": "2025-03-15",
  "vendor": "Acme Corp",
  "total": 4250.00,
  "currency": "EUR",
  "lineItems": [
    { "description": "Consulting", "amount": 3500 },
    { "description": "Expenses", "amount": 750 }
  ]
}

Structured JSON with every field from your schema filled in.

Extract

Structured data extraction

Define a field schema and let Lens pull structured data from any document. Works for invoice numbers, dates, names, amounts, and anything else your workflow needs.

35 files

Acme Corp

123 Business St, Munich

INVOICE

INV-2025-0042

Bill to:

Globex Industries

Berlin, Germany

Date: 2025-03-15

Due: 2025-04-15

DescriptionQtyAmount

Consulting Services40h€3,500.00

Travel Expenses1€750.00

Total€4,250.00

Payment terms: 30 days net · IBAN: DE89 3704 0044 0532 0130 00

Acme Corp · VAT ID: DE123456789

Request

{
  "fileId": "550e8400-...",
  "filenameSchema": "invoice-{number}_{date}"
}

Generated filename

[
  { "fileId": "550e8400-...", "filename": "invoice-INV-2025-0042_2025-03-15" },
  { "fileId": "7a3b1c90-...", "filename": "invoice-INV-2025-0039_2025-03-10" },
  { "fileId": "b2f4e8d1-...", "filename": "invoice-INV-2025-0038_2025-03-08" },
  { "fileId": "c9d5a6f2-...", "filename": "invoice-INV-2025-0035_2025-03-01" },
  { "fileId": "e1a7b3c4-...", "filename": "invoice-INV-2025-0033_2025-02-28" },
  { "fileId": "f8c2d9e5-...", "filename": "invoice-INV-2025-0031_2025-02-25" },
  { "fileId": "a4b6c8d0-...", "filename": "invoice-INV-2025-0029_2025-02-20" },
  { "fileId": "d3e5f7a9-...", "filename": "invoice-INV-2025-0027_2025-02-15" },
  { "fileId": "1b2c3d4e-...", "filename": "invoice-INV-2025-0024_2025-02-10" },
  { "fileId": "5f6a7b8c-...", "filename": "invoice-INV-2025-0021_2025-02-05" },
  ...  // 25 more results
]

Pattern placeholders replaced with values read from the document.

Naming

AI filename generation

Provide a naming pattern like invoice-{number}-{date} and Lens reads the document to fill in the placeholders. Automate your file organization.

Images included in extraction

When using mdd or JSON output, Lens detects and extracts all embedded images from the document. Every image gets a download URL so you can use them in your pipeline or re-render them in a new document. No extra step required.

image-001.png

Three output levels

Choose the depth of extraction you need. From raw text to a fully styled, renderable document.

Standard Markdown

Raw text extraction as clean Markdown. Fast, lightweight, and ideal for search indexing or content migration.

# Quarterly Report

Revenue grew by **23%**
compared to last quarter.

## Key Metrics

| Metric   | Value |
|----------|-------|
| Revenue  | €1.2M |
| Growth   | 23%   |

mdd

Autype Extended Markdown

Markdown plus full document settings, styling defaults, headers, and footers. Re-render with the original look.

---document
size: A4
marginTop: 2.5
marginBottom: 2
---defaults
fontFamily: Inter
fontSize: 11
color: #333333
---
# Quarterly Report

Revenue grew by **23%**

json

Autype Document JSON

Complete structured document with sections, elements, and styling. Ready to import into Autype or process programmatically.

{
  "defaults": {
    "fontFamily": "Inter",
    "fontSize": 11
  },
  "sections": [{
    "type": "flow",
    "content": [
      { "type": "h1", "text": "Report" }
    ]
  }]
}

Built for real workflows

Autype Lens fits into document pipelines where raw OCR isn't enough.

Document digitization

Convert scanned PDFs and legacy documents into editable, structured formats. The original layout and design are preserved.

Content migration

Move documents between systems without losing formatting. Extract styled content and re-render it in Autype or any other platform.

Automated filing

Classify incoming documents, extract key fields, and generate filenames automatically. Build hands-free document intake pipelines.

Data extraction

Pull structured data from invoices, contracts, reports, and forms. Define your schema once and extract at scale via the API.

One API call away

Integrate Lens into your workflow with a simple REST API call. Upload a file, choose your output format, and get structured results.

Request

curl -X POST https://api.autype.com/api/v1/dev/tools/lens/ocr \
  -H "X-API-Key: your_api_key" \
  -H "Content-Type: application/json" \
  -d '{
    "fileId": "550e8400-e29b-41d4-a716-446655440000",
    "outputFormat": "mdd"
  }'

Response

{
  "id": "job_abc123",
  "status": "COMPLETED",
  "result": {
    "outputFormat": "mdd",
    "content": "---document\nsize: A4\n---defaults\nfontFamily: Inter\n---\n# Report Title\n\nContent with **styling** preserved..."
  }
}

Available on all plans. 4 credits per page.

Ready to go beyond OCR?

Start extracting text, layout, and styling from your documents today.

Get Started View API docs