PDF to JSON Converter

Extract tables and data from PDF files and save as JSON. Choose output format: array of arrays or array of objects. 100% private.

1. Upload PDF

Drag & drop or click to browse

Max 50MB

2. JSON Settings

File Status

No PDF selected

Table Extraction

Detects rows/columns based on text positions, preserves structure.

Flexible JSON

Choose array of arrays or objects. Pretty print option.

100% Private

Files never leave your device. No servers.

Unlimited

No restrictions, free forever.

What is a PDF to JSON Converter?

A PDF to JSON converter extracts tabular data from PDF files and saves it as JSON (JavaScript Object Notation). JSON is a lightweight, language-independent format ideal for APIs, databases, and structured data processing. This tool helps you turn bank statements, reports, invoices, or any structured PDF into reusable JSON data.


How to Convert PDF to JSON (Step‑by‑Step)

  1. Upload your PDF – Drag and drop or click to select a PDF (max 50MB).
  2. Choose JSON format – Select “Array of arrays” for raw rows, or “Array of objects” to use the first row as property names.
  3. Click “Convert to JSON” – The tool extracts text, groups it into rows based on vertical position, and sorts columns horizontally.
  4. Preview and download – A preview of the JSON data appears. Click “Download JSON” to save the file.

How the extraction works

The tool uses PDF.js to read every text element and its coordinates (x, y). It then groups items that are close together vertically (forming rows) and orders them left‑to‑right to create columns. This works well for simple tables with consistent line spacing. For complex layouts, you may need to adjust the output manually.


Frequently Asked Questions

Can I extract tables from scanned PDFs?

No, this tool works only with text‑based PDFs. Scanned documents require OCR (Optical Character Recognition) – we plan to add that in the future.

What's the difference between the two JSON formats?

Array of arrays returns each row as an array, e.g. [["Name","Age"],["John",30]]. Array of objects uses the first row as keys, returning [{"Name":"John","Age":30}]. Choose the one that fits your application.

Will the formatting be perfect?

It depends on the PDF structure. Simple, well‑structured tables usually convert accurately. Multi‑column layouts or merged cells may not translate perfectly. You can always edit the resulting JSON.

Is my data safe?

Absolutely. All processing happens in your browser – no file is uploaded to any server.


Last updated: March 2026