Document Processing Superpowers for Your AI Assistant

Connect AI agents to Documentize with MCP

The Documentize MCP server exposes all document processing capabilities as tools for AI agents and LLM clients — convert, merge, extract, sign, and more, directly from Claude Desktop, VS Code Copilot, Cursor, or any MCP-compatible host.

⚡ Endpoint

The MCP server runs over Streamable HTTP at:

https://api.documentize.app/mcp

The server uses stateful sessions. After the initial initialize request the server returns an Mcp-Session-Id header; include it in every subsequent request. MCP clients handle this automatically.

🔌 Connect from Claude Desktop

Add the server to claude_desktop_config.json (%APPDATA%\Claude\ on Windows, ~/Library/Application Support/Claude/ on macOS):

{
  "mcpServers": {
    "documentize": {
      "type": "http",
      "url": "https://api.documentize.app/mcp"
    }
  }
}

Restart Claude Desktop. The Documentize tools will appear in the tool list.

🔌 Connect from VS Code (GitHub Copilot)

Create .vscode/mcp.json in your workspace (or add to User Settings):

{
  "servers": {
    "documentize": {
      "type": "http",
      "url": "https://api.documentize.app/mcp"
    }
  }
}

Open GitHub Copilot Chat, switch to Agent mode, and click the Tools button — Documentize tools will be listed there.

🔌 Connect from Cursor / Cline

In Cursor, open Settings → MCP and add a new server:

{
  "name": "documentize",
  "type": "http",
  "serverUrl": "https://api.documentize.app/mcp"
}

Cline users can add the same entry under MCP Servers in its settings panel.

⚙️ How tasks work

All processing tools are asynchronous. Each call starts a background job and returns a JSON object with a folderName field — that is your task ID.

  1. Call a processing tool (e.g. ConvertDocument) → receive a task ID (folderName).
  2. Call GetTaskStatus with that task ID and poll until statusCode is 200.
  3. When complete, the response contains a sharedFiles array. Each entry has a fileName (display name) and an uploadFileName (a pre-signed S3 download URL). Open any uploadFileName URL in your browser to download the result file.
// 1. Start a task
ConvertDocument(fileUrl: "https://example.com/report.pdf",
                inputType: "pdf", outputType: "docx")
// → { "folderName": "abc123", "statusCode": 204 }

// 2. Poll until done (statusCode 204/202/203 = still processing)
GetTaskStatus(taskId: "abc123")
// → {
//     "statusCode": 200,
//     "sharedFiles": [
//       {
//         "fileName": "report.docx",
//         "uploadFileName": "https://s3.amazonaws.com/...presigned-url..."
//       }
//     ]
//   }

// 3. Download the result
// Open uploadFileName in a browser, or tell the agent:
//   "Download https://s3.amazonaws.com/...presigned-url... and save it as report.docx"

Use GetNextAppSuggestions after any operation to get recommended follow-up tools based on the output format.

📂 Providing your files

The MCP server accepts files in three ways — use whichever fits your workflow:

Local file path (recommended for desktop AI clients)

When running Claude Desktop, Cursor, or Cline on your own machine, simply pass the path to a file on your disk. The AI agent reads it directly — no manual upload needed.

// Windows absolute path
ConvertDocument(fileUrl: "C:\\Users\\Alice\\Documents\\report.pdf",
                inputType: "pdf", outputType: "docx")

// macOS / Linux absolute path
ConvertDocument(fileUrl: "/home/alice/documents/report.pdf",
                inputType: "pdf", outputType: "docx")

You can also use the standard file:// URI format:

// file:// URI — Windows
ConvertDocument(fileUrl: "file:///C:/Users/Alice/Documents/report.pdf",
                inputType: "pdf", outputType: "docx")

// file:// URI — macOS / Linux
ConvertDocument(fileUrl: "file:///home/alice/documents/report.pdf",
                inputType: "pdf", outputType: "docx")

Remote URL

Pass any publicly accessible https:// or http:// URL. The server fetches the file automatically before processing.

ConvertDocument(fileUrl: "https://example.com/report.pdf",
                inputType: "pdf", outputType: "docx")

Practical agent prompts

With a local AI client you can refer to files by name and let the agent resolve the path:

// In Claude Desktop or Cursor chat:
"Convert my file C:\Users\Alice\Downloads\invoice.pdf to Word format"

"Compress /home/alice/thesis.pdf and send me the download link"

"Extract all text from file:///C:/Reports/Q1.pdf"

🧰 Available Tools

Document Operations

  • ConvertDocument — Convert between PDF, DOCX, PPTX, HTML, JPG, PNG, and more.
  • MergeDocuments — Combine multiple files into a single document.
  • SplitDocument — Split a PDF by page ranges, bookmarks, or fixed size.
  • CompressDocument — Reduce PDF file size (low / medium / high).
  • RotateDocument — Rotate all or selected pages by 90°, 180°, or 270°.
  • ResizeDocument — Resize PDF pages to A4, Letter, A3, etc.
  • RemovePages — Delete specific pages from a PDF.

Security

  • LockDocument — Encrypt a PDF with a password.
  • UnlockDocument — Remove a password from a protected PDF.
  • SignDocument — Digitally sign a document.
  • VerifySignature — Verify existing digital signatures.

Content Extraction

  • ExtractText — Pull all text from a document (Pure or Raw mode).
  • ExtractImages — Export all embedded images.
  • ExtractMetadata — Read author, title, creation date, keywords, etc.
  • ExtractFormData — Export data from fillable PDF form fields.
  • ParseDocument — Parse annotations, bookmarks, tables, and more.
  • SearchInDocument — Find text occurrences inside a document.
  • MakeSearchable — Add a text layer to a scanned PDF via OCR.
  • RunOcr — Run OCR on a scanned file or image to produce a searchable PDF.

Form & Structure

  • FlattenDocument — Flatten form fields and annotations (make non-editable).
  • AddTable — Embed a table from an Excel or CSV file into a PDF.
  • AddToc — Add a table of contents to a PDF based on its headings.

AI-Powered

  • GenerateTableOfContents — AI-structured TOC from document headings.
  • GenerateAbstract — Summarize a document (academic / professional / casual style).
  • GenerateChecklist — Extract tasks, requirements, or compliance items from a document.
  • AnalyzeResume — Extract, analyze, or compare a CV against a job description.
  • ChatWithDocument — Ask questions about document content (RAG).
  • GenerateSvg — Create an SVG illustration from a text prompt.
  • AddIllustrations — Generate AI illustrations and embed them in a document.

Status & Utility

  • GetTaskStatus — Check task progress; returns download link when ready.
  • GetNextAppSuggestions — Get recommended follow-up operations for a given output.

📋 Notes

  • File sources can be a local path (C:\file.pdf, /home/user/file.pdf), a file:// URI, or a public https:// URL.
  • For MergeDocuments, pass file URLs as a comma-separated string.
  • Status codes: 202/203 = still processing, 200 = done, 500 = error.
  • MCP protocol version reported by the server: 2025-11-25.

Quick Reference

MCP Endpoint

https://api.documentize.app/mcp

claude_desktop_config.json

{
  "mcpServers": {
    "documentize": {
      "type": "http",
      "url": "https://api.documentize.app/mcp"
    }
  }
}

.vscode/mcp.json

{
  "servers": {
    "documentize": {
      "type": "http",
      "url": "https://api.documentize.app/mcp"
    }
  }
}

Key Facts

  • Protocol: MCP 2025-11-25 (Streamable HTTP)
  • Sessions: stateful — Mcp-Session-Id header required after init
  • Tasks are async — poll GetTaskStatus; download via sharedFiles[].uploadFileName (S3 pre-signed URL)
  • File inputs accept public URLs or local file:// paths

Connect Claude, Cursor, or any MCP-compatible client to 28+ document tools through natural language.

Connect Claude, Cursor, or any MCP-compatible client to 28+ document tools. Convert, compress, sign, extract text, run OCR, and generate AI summaries — all through natural language.

How Documentize MCP Works

1. Connect Your MCP Client

Add Documentize MCP server to Claude Desktop, Cursor, VS Code, or any MCP-compatible client. One line of configuration.

2. Initialize Session

Your client automatically sends an initialize request and receives a session ID. All subsequent calls use this session for tracking (perfect for usage-based monetization).

3. Ask Naturally

"Convert this PDF to DOCX" — your AI assistant calls the appropriate tool with your document. No API documentation needed.

4. Get Results

The server processes asynchronously, polls status, and delivers the converted document or extracted data directly to your AI assistant.

FAQs

Model Context Protocol (MCP) is an open standard that lets AI assistants like Claude and Cursor directly call tools. Instead of writing API code, you just ask naturally. Documentize provides an MCP server with 28 document processing tools your AI can use immediately.
Stateful mode enables per-session tracking, usage metering, and client isolation — essential for monetization. Each client gets a unique session ID after initialize, which you can use to bill by usage, enforce rate limits, or track customer activity. Your AI client handles the session ID automatically.
Yes! Use file:// URIs (e.g., file:///home/user/document.pdf). Your MCP client sends the file path, and Documentize reads it directly from your local filesystem. Perfect for sensitive documents that shouldn't be uploaded to the cloud.
Claude Desktop (with Agent Mode), Cursor IDE, VS Code with Copilot, Continue.dev, and any MCP-compatible client. We also provide direct HTTP access for custom integrations.
Documentize runs on your own infrastructure when self-hosted, or on our EU-based servers for cloud version. For the MCP server, you choose: self-host for complete data control, or use our cloud API for convenience. No document data is ever used to train AI models.
Documentize provides the processing engine. You add authentication middleware, usage tracking, and billing. The stateful session model gives you Mcp-Session-Id headers you can correlate to API keys or user accounts. We offer licensing for commercial redistribution.

Unlock the Power of Document Management

Depth-Layered SVG Architecture

Each SVG is composed of Background, Midground, and Foreground layers with Z-index ordering and dependency tracking. The AI manages layer relationships and removes underperforming layers automatically to keep the output clean and coherent.

Iterative SVG Editing

Improve and modify SVG designs through continuous AI chat.

Fast Vector Creation

Create icons, logos, and illustrations in seconds.

Simple, Usage-Based Pricing

Free Tier

$0
  • 50 document operations/month