tldr/pages/common/tabula.md

# tabula

> Extract tables from PDF files.
> More information: <https://tabula.technology>.

- Extract all tables from a PDF to a CSV file:

`tabula -o {{file.csv}} {{file.pdf}}`

- Extract all tables from a PDF to a JSON file:

`tabula --format JSON -o {{file.json}} {{file.pdf}}`

- Extract tables from pages 1, 2, 3, and 6 of a PDF:

`tabula --pages {{1-3,6}} {{file.pdf}}`

- Extract tables from page 1 of a PDF, guessing which portion of the page to examine:

`tabula --guess --pages {{1}} {{file.pdf}}`

- Extract all tables from a PDF, using ruling lines to determine cell boundaries:

`tabula --spreadsheet {{file.pdf}}`

- Extract all tables from a PDF, using blank space to determine cell boundaries:

`tabula --no-spreadsheet {{file.pdf}}`
tabula: add page 2016-01-21 22:58:30 -05:00			`# tabula`

			`> Extract tables from PDF files.`
Refactor: reword English pages' links' descriptions. 2019-06-03 02:06:36 +02:00			`> More information: <https://tabula.technology>.`
tabula: add page 2016-01-21 22:58:30 -05:00
			`- Extract all tables from a PDF to a CSV file:`

tabula: added alt format example also made ordering of arguments consistent 2016-01-21 23:01:44 -05:00			`tabula -o {{file.csv}} {{file.pdf}}`

			`- Extract all tables from a PDF to a JSON file:`

			`tabula --format JSON -o {{file.json}} {{file.pdf}}`
tabula: add page 2016-01-21 22:58:30 -05:00
			`- Extract tables from pages 1, 2, 3, and 6 of a PDF:`

			`tabula --pages {{1-3,6}} {{file.pdf}}`

			`- Extract tables from page 1 of a PDF, guessing which portion of the page to examine:`

tabula: added alt format example also made ordering of arguments consistent 2016-01-21 23:01:44 -05:00			`tabula --guess --pages {{1}} {{file.pdf}}`
tabula: add page 2016-01-21 22:58:30 -05:00
			`- Extract all tables from a PDF, using ruling lines to determine cell boundaries:`

tabula: added alt format example also made ordering of arguments consistent 2016-01-21 23:01:44 -05:00			`tabula --spreadsheet {{file.pdf}}`
tabula: add page 2016-01-21 22:58:30 -05:00
			`- Extract all tables from a PDF, using blank space to determine cell boundaries:`

tabula: added alt format example also made ordering of arguments consistent 2016-01-21 23:01:44 -05:00			`tabula --no-spreadsheet {{file.pdf}}`