StackShareStackShare
Follow on
StackShare

Discover and share technology stacks from companies around the world.

Follow on

© 2025 StackShare. All rights reserved.

Product

  • Stacks
  • Tools
  • Feed

Company

  • About
  • Contact

Legal

  • Privacy Policy
  • Terms of Service
Apache Parquet
ByApache-parquetApache-parquet

Apache Parquet

#107in Databases
Discussions1
Followers190
OverviewDiscussions1AdoptionAlternativesIntegrations
Try It

What is Apache Parquet?

It is a columnar storage format available to any project in the Hadoop ecosystem, regardless of the choice of data processing framework, data model or programming language.

Apache Parquet is a tool in the Databases category of a tech stack.

Key Features

Columnar storage formatType-specific encodingPig integrationCascading integrationCrunch integrationApache Arrow integrationApache Scrooge integrationAdaptive dictionary encodingPredicate pushdownColumn stats

Apache Parquet Pros & Cons

Pros of Apache Parquet

No pros listed yet.

Cons of Apache Parquet

No cons listed yet.

Apache Parquet Alternatives & Comparisons

What are some alternatives to Apache Parquet?

MySQL

MySQL

The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.

PostgreSQL

PostgreSQL

PostgreSQL is an advanced object-relational database management system that supports an extended subset of the SQL standard, including transactions, foreign keys, subqueries, triggers, user-defined types and functions.

MongoDB

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

Microsoft SQL Server

Microsoft SQL Server

Microsoft® SQL Server is a database management and analysis system for e-commerce, line-of-business, and data warehousing solutions.

SQLite

SQLite

SQLite is an embedded SQL database engine. Unlike most other SQL databases, SQLite does not have a separate server process. SQLite reads and writes directly to ordinary disk files. A complete SQL database with multiple tables, indices, triggers, and views, is contained in a single disk file.

MariaDB

MariaDB

Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry. MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.

Try It

Visit Website

Adoption

On StackShare

Apache Parquet Integrations

Hadoop, Java, Apache Impala, Apache Thrift, Apache Hive and 6 more are some of the popular tools that integrate with Apache Parquet. Here's a list of all 11 tools that integrate with Apache Parquet.

Hadoop
Hadoop
Java
Java
Apache Impala
Apache Impala
Apache Thrift
Apache Thrift
Apache Hive
Apache Hive
Pig
Pig
AWS Data Wrangler
AWS Data Wrangler
DSQ
DSQ
ArcticDB
ArcticDB
Hackolade
Hackolade
StarRocks
StarRocks

Apache Parquet Discussions

Discover why developers choose Apache Parquet. Read real-world technical decisions and stack choices from the StackShare community.

Pardha Saradhi
Pardha Saradhi

Technical Lead

Dec 10, 2020

Needs adviceonAmazon S3Amazon S3Apache ParquetApache ParquetPrestoPresto

Hi,

We are currently storing the data in Amazon S3 using Apache Parquet format. We are using Presto to query the data from S3 and catalog it using AWS Glue catalog. We have Metabase sitting on top of Presto, where our reports are present. Currently, Presto is becoming too costly for us, and we are looking for alternatives for it but want to use the remaining setup (S3, Metabase) as much as possible. Please suggest alternative approaches.

0 views0
Comments
Companies
26
PSPTAG+20
Developers
68
IMFAMJ+62