New: Full API Access available!

Turn PDF tables into clean data.

In seconds.

Upload a PDF and select pages. Let the magic happen. Download everything as XLSX, CSV, or JSON. Built for both no-code workflows and API-first automation.

Try it now API Documentation

Conversion from PDF zu Excel, CSV and JSON

Try pdftables now.

Upload a PDF, choose up to 5 pages, extract all tables, and download clean outputs for spreadsheets and pipelines.

Upload a PDF

Enter pages to analyze

5 pages to analyze

Your document is being analysed. While you wait, you can enter your email address to ensure the results are assigned correctly.

A token is sent to your email address. Please enter this token below to verify your address.

Thank you! The extraction job is running.

View results

More personalized than ever

Define your own target format

Stop manually adding, deleting and swapping rows and columns. Upload your bank statements, credit card bills, or invoices as PDF, create your own structure and export to exactly the format you need.

AI-powered column mapping: AI maps columns of PDF tables directly to your custom format.
Content mapping: Automatic translation of content of columns to your defined values.
Predefined templates: Use one of our templates for DATEV, Xero, QuickBooks, and many more or create your own.

Customize your data

DATEV_EXTF_Buchungsstapel.csv

Umsatz	Soll/Haben	BU-Schlüssel	Belegdatum	Konto
1250,00	S	9	1503	8400
45,99	H	9	1603	4900
3200,50	S	8	1803	8300
89,70	H	9	2103	4930
560,00	S	9	2203	8400
149,99	H	9	2303	4950
980,25	S	8	2403	8300
72,10	H	9	2503	4925
430,00	S	9	2603	8400
215,60	H	9	2703	4980
1750,00	S	8	2803	8300

Core features

Precise table extraction like enterprise software and easy to use as a simple tool.

Accurate table detection

Designed for tricky layouts, including multi-row and complex headers, so extracted data stays structured.

Page selection & control

Process only the pages you need and avoid unnecessary extraction noise from the rest of the document.

Multiple exports

Get XLSX, CSV, and JSON outputs from one run and choose the best format per downstream consumer.

Job-based tracking

Every extraction is tracked with status and metadata, making reruns and audits reproducible.

API-first automation

Integrate extraction directly into ETL and internal workflows through simple, predictable endpoints.

Optional OCR for scanned documents

Enable OCR-assisted extraction when source PDFs are scanned and text layers are unavailable.

API for automation

This is all you need: Upload a file, receive a job_id, poll status, and download extracted tables in the format you need.

POST /v1/upload
GET /v1/extraction/{job_id}
GET /v1/download/{table_id}?format=xlsx|csv|json

View API documentation

How it works

Drop a file!

Drag a PDF document and drop it in the selected area. The upload starts immediately.

Select pages

Select those pages you wish to extract tables from. Just like you may know from apps as Microsoft Word (1,2,5-7,11,...)

Start the job

Just one click to start the job. You don't have to wait here. The job runs in background.

View results

After the job is finished, you can access the result data to preview, select and download the tables.

Table preview

View the table preview before downloading for more efficient work.

Mass download all data

Or you can select all tables you whish and download a zip containing all files.

Who is pdftables.io for?

Tailored solutions for yout industry needs

Finance & Controlling

Convert reporting tables from PDFs into clean datasets for monthly close and analysis.

Accounting / Tax / Audit

Extract document-based tables faster while keeping records consistent for checks and reviews.

Procurement & Operations

Capture supplier and operations data from PDF documents without manual copy/paste.

Data & BI teams

Feed standardized extraction outputs into dashboards, data models, and ETL jobs.

Developers / SaaS builders

Embed table extraction with API endpoints to automate ingestion in your product stack.

Individuals & freelancers

Handle one-off client PDFs quickly and export directly into your preferred format.

Benefits at a glance

Designed for easy usage combined with powerful performance

Save time vs. manual copy/paste across recurring PDF workflows
Reduce extraction errors by standardizing outputs per table and job
Standardize downstream data for spreadsheets, BI tools, and APIs
Automate recurring workflows through predictable job and download endpoints
Scale from one-off files to batch-style operational processing

checkmark Security & privacy: Files processed securely. Data retention configurable.

Pricing

Start free. Upgrade when you need more.

Free

$0 forever

High quality extraction
OCR (for scanned documents)
Data preview
XLS, CSV and JSON download
Up to 5 pages/month
Up to 10 MB per file
No API access
No encrypted files supported

Pro

$19.90 / month
or
$149 / year

High quality extraction
OCR (for scanned documents)
Data preview
XLS, CSV and JSON download
Up to 500 pages/month
Up to 50 MB per file
API access enabled
No encrypted files supported

Team

$89 / month
or
$999 / year

High quality extraction
OCR (for scanned documents)
Data preview
XLS, CSV and JSON download
Up to 5000 pages/month
Up to 500 MB per file
API access enabled
Encrypted files supported

Need more pages? Feature request? Contact us.

FAQ

What does pdftables.io do?

pdftables.io extracts tables from PDF documents and converts them into structured data formats like Excel, CSV, and JSON, eliminating manual copy-paste work.

What types of PDFs are supported?

Both digital PDFs and scanned documents are supported. For scanned files, OCR is automatically used to detect and extract table content.

How accurate is the table detection?

The system is optimized for complex layouts, including multi-row headers, merged cells, and irregular table structures, ensuring highly structured outputs.

Can I choose which pages to process?

Yes. You can specify exact page ranges to extract tables only from relevant sections of a document.

Which export formats are available?

Extracted data can be downloaded as XLSX, CSV, or JSON, making it easy to integrate into different workflows.

Is there an API available?

Yes. pdftables.io provides a REST API so you can automate extraction and integrate it directly into your applications or data pipelines.

How long does extraction take?

Most jobs complete within seconds, depending on file size, number of pages, and table complexity.

Is my data secure?

Files are processed securely and stored only as long as needed for extraction and download, following standard data protection practices.

Do I need to install anything?

No installation is required. pdftables.io runs entirely in the browser as a cloud-based SaaS.

Can I process large volumes of PDFs?

Yes. The platform is designed to handle both single documents and large-scale batch processing through the API.

Turn PDF tables into clean data — now.

Try it now for free Read API docs