Skip to content
DE
Doc Extract
FeaturesFormatsHow It WorksFAQContact
Supports 100+ file formats

Extract Text from Any Document, Instantly

Upload files or paste URLs. Get structured text, metadata, and tables from 100+ formats with OCR and audio transcription. Self-hosted and 100% private.

.pdf.docx.xlsx.pptx.epub.mp3.mp4.png.jpg.svg.csv.json.html.md.wav.flac.mkv.ogg.doc.xls.pdf.docx.xlsx.pptx.epub.mp3.mp4.png.jpg.svg.csv.json.html.md.wav.flac.mkv.ogg.doc.xls
.ppt.rtf.odt.eml.txt.xml.yaml.py.ts.go.rs.flv.3gp.mpeg.ac3.ape.caf.m4v.ogv.wv.ppt.rtf.odt.eml.txt.xml.yaml.py.ts.go.rs.flv.3gp.mpeg.ac3.ape.caf.m4v.ogv.wv
Formats
100+
Code Languages
40+
STT Languages
100
Categories
18
Tesseract OCR
OpenAI Whisper
Self-Hosted
100% Private

Powerful Extraction Features

Everything you need to extract, transcribe, and structure content from any file type.

Universal Format Support

Extract text from 100+ file formats across 18 categories — documents, spreadsheets, presentations, emails, ebooks, images, and more.

OCR Extraction

Powered by Tesseract OCR with support for 21+ languages. Extract text from scanned PDFs, photographs, and embedded images automatically.

Audio & Video Transcription

Transcribe audio and video files with OpenAI Whisper. Supports 100 languages with automatic language detection.

Async Job Processing

Submit large files as background jobs and track progress in real-time. Get notified via webhooks when processing completes.

Smart Caching

MD5 deduplication gives instant results for previously processed files.

Three-Layer Validation

Extension, MIME type, and magic bytes verification on every upload.

Table Extraction

Extract structured tables from PDFs, spreadsheets, and presentations.

Metadata Extraction

Get author, title, dates, page count, and format metadata automatically.

Multi-Language

21+ OCR languages and 100 transcription languages supported.

Self-Hosted & Private

Your data never leaves your infrastructure. Fully self-hostable.

URL & Webpage Extraction

Extract text from any URL or webpage — just paste the link.

Production Ready

Rate limiting, circuit breakers, and comprehensive error handling built in.

REST API

Full REST API with interactive OpenAPI docs at /docs and /redoc.

Webhook Callbacks

HMAC-signed webhook notifications when async jobs complete.

AI-Powered Analysis

Summarize, extract entities, ask questions, and translate with LLM.

Text-to-Speech

Browser-based TTS playback with voice selection.

Full-Text Search

PostgreSQL-powered ranked full-text search across extractions.

How It Works

From upload to structured output in six steps.

1

Upload

File or URL
2

Validate

Three-layer check
3

Detect

Auto format routing
4

Extract

Text from structure
5

Enhance

OCR, STT, tables, metadata
6

Deliver

Structured JSON

Supported Formats

100+ file formats across 18 categories. Browse by type below.

PDFOCR
.pdf
Documents
WordOCR
.docx
Documents
Word (Legacy)OCR
.doc
Documents
RTF
.rtf
Documents
ODT
.odt
Documents
ExcelOCR
.xlsx
Spreadsheets
Excel (Legacy)OCR
.xls
Spreadsheets
ODS
.ods
Spreadsheets
PowerPointOCR
.pptx
Presentations
PowerPoint (Legacy)OCR
.ppt
Presentations
ODP
.odp
Presentations
XPS/OXPS
.xps.oxps
Document

Structured Output

Every extraction returns clean, structured JSON — ready for your pipeline.

quarterly-report.pdf
2.4 MB · PDF Document
OCRTablesMetadata
response.json
{
  "filename": "quarterly-report.pdf",
  "text": "Q4 2024 Financial Summary...",
  "metadata": {
    "author": "Finance Team",
    "pages": 24,
    "created": "2024-12-15T09:30:00Z"
  },
  "tables": [
    { "title": "Revenue by Region", "rows": 12 }
  ],
  "processing_ms": 847
}

Frequently Asked Questions

Common questions about Doc Extract.

We'd Love to Hear from You

Whether you have a question, need a hand getting started, or want to explore what's possible — our team is here and happy to help.

Help & Support

Stuck on something? We'll walk you through it. From setup questions to troubleshooting, no question is too small.

AI Development

Building something with AI? Let's talk. We love collaborating on intelligent document workflows and custom extraction pipelines.

API Integration

Need to plug Doc Extract into your stack? We'll help you design a seamless integration that fits your architecture.

We're a small, passionate team that genuinely enjoys helping people build great things. Drop us a line anytime — we read every message and typically reply within a few hours.

support@apidly.comteam@apidly.com

Ready to extract text from anything?

100+ formats, OCR, audio transcription, structured output. Self-hosted and private. Get started in seconds.

DE
Doc Extract — Document extraction platform
Built with Next.js, Tesseract & Whisper