Prometheus vs Thanos

Overview

Prometheus

Stacks4.8K

Followers3.8K

Votes239

GitHub Stars61.1K

Forks9.9K

Thanos

Stacks101

Followers126

Votes0

Prometheus vs Thanos: What are the differences?

Prometheus and Thanos are two popular tools used for monitoring and observability in modern infrastructure and cloud-native applications. Let's explore the key differences between Prometheus and Thanos:

Data Storage: Prometheus is designed to store metrics locally on each individual node or server. It uses a time-series database for efficient storage and retrieval of metrics data. On the other hand, Thanos extends Prometheus by providing a robust distributed storage layer. It allows metrics to be stored in a highly available and scalable manner across multiple clustered instances, enabling long-term retention and global query views.
Data Retention: Prometheus retains metrics data on a short-term basis, typically up to a few weeks. While it offers local data storage and efficient querying, it lacks the capability for long-term retention and global querying of historical metrics data. Thanos, on the other hand, provides long-term data retention by leveraging its distributed storage layer, allowing organizations to retain metrics data for months or even years. This enables deep insights and analysis of historical trends and patterns.
High Availability: Prometheus is designed to be deployed as a single instance, making it susceptible to single points of failure. In case of a failure, metrics data can be lost or become inaccessible. Thanos addresses this issue by providing a highly available architecture. It allows data replication and redundancy across multiple instances, ensuring continuous availability even if individual nodes fail. This makes Thanos well-suited for mission-critical monitoring and observability requirements.
Federation and Global Queries: Prometheus supports a federation feature that allows multiple Prometheus instances to be centrally queried. However, federated queries in Prometheus can be resource-intensive and result in increased latency. Thanos provides a more efficient solution by introducing global queries. Thanos combines data from multiple Prometheus instances transparently, providing a unified view for querying across all the data without the need for expensive federation operations.
Data Deduplication: In Prometheus, if multiple instances scrape the same target, duplicate metrics can occur. Although Prometheus de-duplicates metrics during querying, the duplicate data is still stored, leading to increased storage requirements. Thanos tackles this issue by performing data deduplication during the compaction process, reducing storage costs by eliminating redundant metrics and storing only unique data.
Horizontal Scalability: Prometheus is designed to run as a vertical scaling solution, where a single instance can handle a certain volume of metrics data. As the data grows, additional instances need to be deployed in a sharded setup. Thanos, on the other hand, provides horizontal scalability out of the box. It allows for seamless scaling by adding more instances to the cluster, thus distributing the load and handling increased metrics ingestion and query traffic.

In summary, Prometheus excels in local data storage and efficient querying, while Thanos extends Prometheus by providing distributed storage, long-term retention, high availability, global querying, data deduplication, and horizontal scalability. Thanos is a powerful tool for organizations requiring scalable, fault-tolerant, and long-term storage and analysis of massive metrics data.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Prometheus, Thanos

Leonardo

Pleno QA Enginneer at SolarMarket

Dec 8, 2020

Decided

The objective of this work was to develop a system to monitor the materials of a production line using IoT technology. Currently, the process of monitoring and replacing parts depends on manual services. For this, load cells, microcontroller, Broker MQTT, Telegraf, InfluxDB, and Grafana were used. It was implemented in a workflow that had the function of collecting sensor data, storing it in a database, and visualizing it in the form of weight and quantity. With these developed solutions, he hopes to contribute to the logistics area, in the replacement and control of materials.

403k views403k

Comments

Raja Subramaniam

Aug 27, 2019

Needs adviceon

Prometheus

Kubernetes

Sysdig

We have Prometheus as a monitoring engine as a part of our stack which contains Kubernetes cluster, container images and other open source tools. Also, I am aware that Sysdig can be integrated with Prometheus but I really wanted to know whether Sysdig or sysdig+prometheus will make better monitoring solution.

779k views779k

Comments

Susmita

Senior SRE at African Bank

Jul 28, 2020

Needs adviceon

Grafana

Looking for a tool which can be used for mainly dashboard purposes, but here are the main requirements:

Must be able to get custom data from AS400,
Able to display automation test results,
System monitoring / Nginx API,
Able to get data from 3rd parties DB.

Grafana is almost solving all the problems, except AS400 and no database to get automation test results.

869k views869k

Comments

Detailed Comparison

Prometheus	Thanos
Prometheus is a systems and service monitoring system. It collects metrics from configured targets at given intervals, evaluates rule expressions, displays the results, and can trigger alerts if some condition is observed to be true.	Thanos is a set of components that can be composed into a highly available metric system with unlimited storage capacity. It can be added seamlessly on top of existing Prometheus deployments and leverages the Prometheus 2.0 storage format to cost-efficiently store historical metric data in any object storage while retaining fast query latencies. Additionally, it provides a global query view across all Prometheus installations and can merge data from Prometheus HA pairs on the fly.
Dimensional data; Powerful queries; Great visualization; Efficient storage; Precise alerting; Simple operation	Global querying view across all connected Prometheus servers; Deduplication and merging of metrics collected from Prometheus HA pairs; Seamless integration with existing Prometheus setups; Any object storage as its only, optional dependency; Downsampling historical data for massive query speedup; Cross-cluster federation; Fault-tolerant query routing; Simple gRPC "Store API" for unified data access across all metric data; Easy integration points for custom metric providers
Statistics
GitHub Stars 61.1K	GitHub Stars -
GitHub Forks 9.9K	GitHub Forks -
Stacks 4.8K	Stacks 101
Followers 3.8K	Followers 126
Votes 239	Votes 0
Pros & Cons
Pros 47 Powerful easy to use monitoring 38 Flexible query language 32 Dimensional data model 27 Alerts 23 Active and responsive community Cons 12 Just for metrics 6 Bad UI 6 Needs monitoring to access metrics endpoints 4 Not easy to configure and use 3 Supports only active agents	No community feedback yet
Integrations
Grafana	No integrations available

What are some alternatives to Prometheus, Thanos?

Grafana

Grafana is a general purpose dashboard and graph composer. It's focused on providing rich ways to visualize time series metrics, mainly though graphs but supports other ways to visualize data through a pluggable panel architecture. It currently has rich support for for Graphite, InfluxDB and OpenTSDB. But supports other data sources via plugins.

Kibana

Kibana is an open source (Apache Licensed), browser based analytics and search dashboard for Elasticsearch. Kibana is a snap to setup and start using. Kibana strives to be easy to get started with, while also being flexible and powerful, just like Elasticsearch.

Nagios

Nagios is a host/service/network monitoring program written in C and released under the GNU General Public License.

Netdata

Netdata collects metrics per second & presents them in low-latency dashboards. It's designed to run on all of your physical & virtual servers, cloud deployments, Kubernetes clusters & edge/IoT devices, to monitor systems, containers & apps

Zabbix

Zabbix is a mature and effortless enterprise-class open source monitoring solution for network monitoring and application monitoring of millions of metrics.

Sensu

Sensu is the future-proof solution for multi-cloud monitoring at scale. The Sensu monitoring event pipeline empowers businesses to automate their monitoring workflows and gain deep visibility into their multi-cloud environments.

Graphite

Graphite does two things: 1) Store numeric time-series data and 2) Render graphs of this data on demand

Lumigo

Lumigo is an observability platform built for developers, unifying distributed tracing with payload data, log management, and real-time metrics to help you deeply understand and troubleshoot your systems.

StatsD

It is a network daemon that runs on the Node.js platform and listens for statistics, like counters and timers, sent over UDP or TCP and sends aggregates to one or more pluggable backend services (e.g., Graphite).

Jaeger

Jaeger, a Distributed Tracing System

Related Comparisons

Bitbucket vs GitHub vs GitLab

AWS CodeCommit vs Bitbucket vs GitHub

Docker Swarm vs Kubernetes vs Rancher

Grunt vs Webpack vs gulp

Grafana vs Graphite vs Kibana

Overview

Prometheus

Stacks4.8K

Followers3.8K

Votes239

GitHub Stars61.1K

Forks9.9K

Thanos

Stacks101

Followers126

Votes0

Prometheus vs Thanos: What are the differences?

Data Storage: Prometheus is designed to store metrics locally on each individual node or server. It uses a time-series database for efficient storage and retrieval of metrics data. On the other hand, Thanos extends Prometheus by providing a robust distributed storage layer. It allows metrics to be stored in a highly available and scalable manner across multiple clustered instances, enabling long-term retention and global query views.
Data Retention: Prometheus retains metrics data on a short-term basis, typically up to a few weeks. While it offers local data storage and efficient querying, it lacks the capability for long-term retention and global querying of historical metrics data. Thanos, on the other hand, provides long-term data retention by leveraging its distributed storage layer, allowing organizations to retain metrics data for months or even years. This enables deep insights and analysis of historical trends and patterns.
High Availability: Prometheus is designed to be deployed as a single instance, making it susceptible to single points of failure. In case of a failure, metrics data can be lost or become inaccessible. Thanos addresses this issue by providing a highly available architecture. It allows data replication and redundancy across multiple instances, ensuring continuous availability even if individual nodes fail. This makes Thanos well-suited for mission-critical monitoring and observability requirements.
Federation and Global Queries: Prometheus supports a federation feature that allows multiple Prometheus instances to be centrally queried. However, federated queries in Prometheus can be resource-intensive and result in increased latency. Thanos provides a more efficient solution by introducing global queries. Thanos combines data from multiple Prometheus instances transparently, providing a unified view for querying across all the data without the need for expensive federation operations.
Data Deduplication: In Prometheus, if multiple instances scrape the same target, duplicate metrics can occur. Although Prometheus de-duplicates metrics during querying, the duplicate data is still stored, leading to increased storage requirements. Thanos tackles this issue by performing data deduplication during the compaction process, reducing storage costs by eliminating redundant metrics and storing only unique data.
Horizontal Scalability: Prometheus is designed to run as a vertical scaling solution, where a single instance can handle a certain volume of metrics data. As the data grows, additional instances need to be deployed in a sharded setup. Thanos, on the other hand, provides horizontal scalability out of the box. It allows for seamless scaling by adding more instances to the cluster, thus distributing the load and handling increased metrics ingestion and query traffic.

Advice on Prometheus, Thanos

Leonardo

Pleno QA Enginneer at SolarMarket

Dec 8, 2020

Decided

403k views403k

Comments

Raja Subramaniam

Aug 27, 2019

Needs adviceon

779k views779k

Senior SRE at African Bank

Jul 28, 2020

Needs adviceon

Grafana

Looking for a tool which can be used for mainly dashboard purposes, but here are the main requirements:

Must be able to get custom data from AS400,
Able to display automation test results,
System monitoring / Nginx API,
Able to get data from 3rd parties DB.

Grafana is almost solving all the problems, except AS400 and no database to get automation test results.

869k views869k

Comments

Detailed Comparison

Prometheus	Thanos
Prometheus is a systems and service monitoring system. It collects metrics from configured targets at given intervals, evaluates rule expressions, displays the results, and can trigger alerts if some condition is observed to be true.	Thanos is a set of components that can be composed into a highly available metric system with unlimited storage capacity. It can be added seamlessly on top of existing Prometheus deployments and leverages the Prometheus 2.0 storage format to cost-efficiently store historical metric data in any object storage while retaining fast query latencies. Additionally, it provides a global query view across all Prometheus installations and can merge data from Prometheus HA pairs on the fly.
Dimensional data; Powerful queries; Great visualization; Efficient storage; Precise alerting; Simple operation	Global querying view across all connected Prometheus servers; Deduplication and merging of metrics collected from Prometheus HA pairs; Seamless integration with existing Prometheus setups; Any object storage as its only, optional dependency; Downsampling historical data for massive query speedup; Cross-cluster federation; Fault-tolerant query routing; Simple gRPC "Store API" for unified data access across all metric data; Easy integration points for custom metric providers
Statistics
GitHub Stars 61.1K	GitHub Stars -
GitHub Forks 9.9K	GitHub Forks -
Stacks 4.8K	Stacks 101
Followers 3.8K	Followers 126
Votes 239	Votes 0
Pros & Cons
Pros 47 Powerful easy to use monitoring 38 Flexible query language 32 Dimensional data model 27 Alerts 23 Active and responsive community Cons 12 Just for metrics 6 Bad UI 6 Needs monitoring to access metrics endpoints 4 Not easy to configure and use 3 Supports only active agents	No community feedback yet
Integrations
Grafana	No integrations available