Logo

Benchmarks

Measured performance across document formats

Processing Speed

Throughput comparison • Higher is better

PDF
file
MB/S SPEED
2.93
AVG COMPETITOR
0.11

FASTER

25.9x

Memory Usage

RAM consumption • Lower is better

PDF
file

Kreuzberg

553.16MB

Markitdown

178.19MB

Pymupdf4llm

350.50MB

Docling

2056.45MB
KREUZBERG
553.16MB
AVG COMPETITOR
861.71MB

LIGHTER

1.6x

Extraction Time

Pure processing time • Lower is better

PDF
file

Kreuzberg

44.20ms

Markitdown

671.58ms

Pymupdf4llm

1691.76ms

Docling

10930.72ms
KREUZBERG
44.20ms
AVG COMPETITOR
4431.35ms

FASTER

100.3x

Extraction Accuracy (TF1)

Extraction accuracy (F1 Score) (TF1) • Higher is better

PDF
file

Kreuzberg

0.970

Markitdown

1.000

Docling

0.960

Pymupdf4llm

0.960
KREUZBERG
0.97
AVG COMPETITOR
0.97

MORE ACCURATE

1.0x

Structural Accuracy (SF1)

Structure preservation (SF1) • Higher is better

PDF
file

Kreuzberg

0.300

Docling

0.610

Pymupdf4llm

0.580

Markitdown

0.340
KREUZBERG
0.30
AVG COMPETITOR
0.51

MORE ACCURATE

0.6x

Success Rate

Success rate • Higher is better

PDF
file
96.51%

Kreuzberg

100%

Pymupdf4llm

99.39%

Markitdown

95.35%

Docling

Cold Start

Framework initialization time • Lower is better

PDF
file

Kreuzberg

187ms

Pdfminer

0ms

Pdfplumber

0ms

Pdftotext

0ms

Playa-pdf

0ms

Pypdf

0ms

Tika

0ms

Pandoc

202ms

Pymupdf4llm

2747ms

Markitdown

3468ms

Docling

18614ms
KREUZBERG
187.29ms
AVG COMPETITOR
202.18ms

FASTER

1.1x

Installation Footprint

Framework installation size • Lower is better

PDF
file

Kreuzberg

N/A

Pypdf

1.34MB

Playa-pdf

2.38MB

Pdfminer

28.80MB

Pymupdf4llm

49.11MB

Pdfplumber

61.13MB

Pdftotext

76.39MB

Pandoc

189.78MB

Markitdown

195.50MB

Tika

259.40MB

Docling

2832.41MB
KREUZBERG
---MB
AVG COMPETITOR
1.34MB

SMALLER

-x

CPU Usage

Processing PDF files • Real-time Replay

PDF
file

Live CPU Usage

0.0s / 25.0s

Kreuzberg

0%

Pandoc

No data
Markitdown

0%

Docling

0%

Pymupdf4llm

0%

Average CPU usage

CPU %

Framework Capabilities

Feature support comparison across frameworks

FrameworkOCR SUPPORTBATCH PROCESSINGASYNC SUPPORT
Kreuzberg
Docling
Markitdown
Pandoc
Pdfminer
Pdfplumber
Pdftotext
Playa-pdf
Pymupdf4llm
Pypdf
Tika

Download Benchmark Data

All raw benchmark results are publicly available. Download in JSON format for your own analysis.

We value your privacy

Kreuzberg uses cookies to improve your experience, personalize content, and analyze traffic. You can manage your preferences at any time.