1
0
Fork 0
mirror of https://github.com/tldr-pages/tldr.git synced 2025-07-01 18:15:24 +02:00
tldr/pages/common/parquet-tools.md
Juri Dispan 0f0c9da99e
*: replace dead more information links (#11542)
* *: replace dead more informtion links

* dolt

* Apply suggestions from code review

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

---------

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>
2023-11-19 22:07:13 +01:00

918 B

parquet-tools

A tool to show, inspect and manipulate Parquet file. More information: https://github.com/apache/parquet-mr.

  • Display the content of a Parquet file:

parquet-tools cat {{path/to/parquet}}

  • Display the first few lines of a Parquet file:

parquet-tools head {{path/to/parquet}}

  • Print the schema of a Parquet file:

parquet-tools schema {{path/to/parquet}}

  • Print the metadata of a Parquet file:

parquet-tools meta {{path/to/parquet}}

  • Print the content and metadata of a Parquet file:

parquet-tools dump {{path/to/parquet}}

  • Concatenate several Parquet files into the target one:

parquet-tools merge {{path/to/parquet1}} {{path/to/parquet2}} {{path/to/target_parquet}}

  • Print the count of rows in a Parquet file:

parquet-tools rowcount {{path/to/parquet}}

  • Print the column and offset indexes of a Parquet file:

parquet-tools column-index {{path/to/parquet}}