Apache Parquet

Documentation Releases

Apache Parquet is an open source, column-oriented data file format designed for efficient data storage and retrieval. It provides high performance compression and encoding schemes to handle complex data in bulk and is supported in many programming languages and analytics tools.

Documentation

Browse project documentation including the format specification.

Read more

Contributions welcome!

We do a Pull Request contributions workflow on GitHub. New users are always welcome!

Read more

Follow us on Twitter!

For announcements of latest features etc.

Read more