New: Full API Access available!

Turn PDF tables into clean data.

In seconds.

Upload a PDF and select pages. Let the magic happen. Download everything as XLSX, CSV, or JSON. Built for both no-code workflows and API-first automation.

Conversion from PDF zu Excel, CSV and JSON

Try pdftables now.

Upload a PDF, choose up to 5 pages, extract all tables, and download clean outputs for spreadsheets and pipelines.

Upload a PDF
arrow-right-icon

5 pages to analyze

arrow-right-icon
loading indicator

Your document is being analysed. While you wait, you can enter your email address to ensure the results are assigned correctly.

A token is sent to your email address. Please enter this token below to verify your address.

Thank you! The extraction job is running.

View results
More personalized than ever

Define your own target format

Stop manually adding, deleting and swapping rows and columns. Upload your bank statements, credit card bills, or invoices as PDF, create your own structure and export to exactly the format you need.

  • AI-powered column mapping: AI maps columns of PDF tables directly to your custom format.
  • Content mapping: Automatic translation of content of columns to your defined values.
  • Predefined templates: Use one of our templates for DATEV, Xero, QuickBooks, and many more or create your own.
Customize your data
DATEV_EXTF_Buchungsstapel.csv
Umsatz Soll/Haben BU-Schlüssel Belegdatum Konto
1250,00 S 9 1503 8400
45,99 H 9 1603 4900
3200,50 S 8 1803 8300
89,70 H 9 2103 4930
560,00 S 9 2203 8400
149,99 H 9 2303 4950
980,25 S 8 2403 8300
72,10 H 9 2503 4925
430,00 S 9 2603 8400
215,60 H 9 2703 4980
1750,00 S 8 2803 8300

Core features

Precise table extraction like enterprise software and easy to use as a simple tool.

Accurate table detection

Accurate table detection

Designed for tricky layouts, including multi-row and complex headers, so extracted data stays structured.

Page selection & control

Page selection & control

Process only the pages you need and avoid unnecessary extraction noise from the rest of the document.

Multiple exports

Multiple exports

Get XLSX, CSV, and JSON outputs from one run and choose the best format per downstream consumer.

Job-based tracking

Job-based tracking

Every extraction is tracked with status and metadata, making reruns and audits reproducible.

API-first automation

API-first automation

Integrate extraction directly into ETL and internal workflows through simple, predictable endpoints.

Optional OCR for scanned documents

Optional OCR for scanned documents

Enable OCR-assisted extraction when source PDFs are scanned and text layers are unavailable.

API for automation

This is all you need: Upload a file, receive a job_id, poll status, and download extracted tables in the format you need.

  • POST /v1/upload
  • GET /v1/extraction/{job_id}
  • GET /v1/download/{table_id}?format=xlsx|csv|json
View API documentation
API cURL example

How it works

Drop a file!

Drag a PDF document and drop it in the selected area. The upload starts immediately.

Workflow: Upload files

Select pages

Select those pages you wish to extract tables from. Just like you may know from apps as Microsoft Word (1,2,5-7,11,...)

Workflow: Upload files

Start the job

Just one click to start the job. You don't have to wait here. The job runs in background.

Workflow: Start Job

View results

After the job is finished, you can access the result data to preview, select and download the tables.

Workflow: View results

Table preview

View the table preview before downloading for more efficient work.

Workflow: View and download tables

Mass download all data

Or you can select all tables you whish and download a zip containing all files.

Workflow: Mass download all data

Who is pdftables.io for?

Tailored solutions for yout industry needs

Finance & Controlling

Finance & Controlling

Convert reporting tables from PDFs into clean datasets for monthly close and analysis.

Accounting / Tax / Audit

Accounting / Tax / Audit

Extract document-based tables faster while keeping records consistent for checks and reviews.

Procurement & Operations

Procurement & Operations

Capture supplier and operations data from PDF documents without manual copy/paste.

Data & BI teams

Data & BI teams

Feed standardized extraction outputs into dashboards, data models, and ETL jobs.

Developers / SaaS builders

Developers / SaaS builders

Embed table extraction with API endpoints to automate ingestion in your product stack.

Individuals & freelancers

Individuals & freelancers

Handle one-off client PDFs quickly and export directly into your preferred format.

Benefits at a glance

Designed for easy usage combined with powerful performance

  • checkmark
    Save time vs. manual copy/paste across recurring PDF workflows
  • checkmark
    Reduce extraction errors by standardizing outputs per table and job
  • checkmark
    Standardize downstream data for spreadsheets, BI tools, and APIs
  • checkmark
    Automate recurring workflows through predictable job and download endpoints
  • checkmark
    Scale from one-off files to batch-style operational processing

checkmark Security & privacy: Files processed securely. Data retention configurable.

Pricing

Start free. Upgrade when you need more.

Free

 
$0 forever
 

  • checkmark iconHigh quality extraction
  • checkmark iconOCR (for scanned documents)
  • checkmark iconData preview
  • checkmark iconXLS, CSV and JSON download
  • checkmark iconUp to 5 pages/month
  • checkmark iconUp to 10 MB per file
  • checkmark iconNo API access
  • checkmark iconNo encrypted files supported

Team

$89 / month
or
$999 / year

  • checkmark iconHigh quality extraction
  • checkmark iconOCR (for scanned documents)
  • checkmark iconData preview
  • checkmark iconXLS, CSV and JSON download
  • checkmark iconUp to 5000 pages/month
  • checkmark iconUp to 500 MB per file
  • checkmark iconAPI access enabled
  • checkmark iconEncrypted files supported

Need more pages? Feature request? Contact us.

FAQ

What does pdftables.io do?

pdftables.io extracts tables from PDF documents and converts them into structured data formats like Excel, CSV, and JSON, eliminating manual copy-paste work.

What types of PDFs are supported?

Both digital PDFs and scanned documents are supported. For scanned files, OCR is automatically used to detect and extract table content.

How accurate is the table detection?

The system is optimized for complex layouts, including multi-row headers, merged cells, and irregular table structures, ensuring highly structured outputs.

Can I choose which pages to process?

Yes. You can specify exact page ranges to extract tables only from relevant sections of a document.

Which export formats are available?

Extracted data can be downloaded as XLSX, CSV, or JSON, making it easy to integrate into different workflows.

Is there an API available?

Yes. pdftables.io provides a REST API so you can automate extraction and integrate it directly into your applications or data pipelines.

How long does extraction take?

Most jobs complete within seconds, depending on file size, number of pages, and table complexity.

Is my data secure?

Files are processed securely and stored only as long as needed for extraction and download, following standard data protection practices.

Do I need to install anything?

No installation is required. pdftables.io runs entirely in the browser as a cloud-based SaaS.

Can I process large volumes of PDFs?

Yes. The platform is designed to handle both single documents and large-scale batch processing through the API.

Turn PDF tables into clean data — now.