Monitoring Tools

Alternatives to collectd

StatsD, Nagios, Ganglia, Fluentd, and Prometheus are the most popular alternatives and competitors to collectd.

Stacks98

Followers156

+ 1

Votes5

What is collectd and what are its top alternatives?

collectd gathers statistics about the system it is running on and stores this information. Those statistics can then be used to find current performance bottlenecks (i.e. performance analysis) and predict future system load (i.e. capacity planning). Or if you just want pretty graphs of your private server and are fed up with some homegrown solution you're at the right place, too.

collectd is a tool in the Monitoring Tools category of a tech stack.

collectd is an open source tool with 3.2K GitHub stars and 1.2K GitHub forks. Here’s a link to collectd's open source repository on GitHub

Top Alternatives to collectd

StatsD
It is a network daemon that runs on the Node.js platform and listens for statistics, like counters and timers, sent over UDP or TCP and sends aggregates to one or more pluggable backend services (e.g., Graphite). ...
Nagios
Nagios is a host/service/network monitoring program written in C and released under the GNU General Public License. ...
Ganglia
It is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids. It is based on a hierarchical design targeted at federations of clusters. ...
Fluentd
Fluentd collects events from various data sources and writes them to files, RDBMS, NoSQL, IaaS, SaaS, Hadoop and so on. Fluentd helps you unify your logging infrastructure. ...
Prometheus
Prometheus is a systems and service monitoring system. It collects metrics from configured targets at given intervals, evaluates rule expressions, displays the results, and can trigger alerts if some condition is observed to be true. ...
Munin
Munin is a networked resource monitoring tool that can help analyze resource trends and "what just happened to kill our performance?" problems. It is designed to be very plug and play. A default installation provides a lot of graphs with almost no work. ...
Telegraf
It is an agent for collecting, processing, aggregating, and writing metrics. Design goals are to have a minimal memory footprint with a plugin system so that developers in the community can easily add support for collecting metrics. ...
Metricbeat
Collect metrics from your systems and services. From CPU to memory, Redis to NGINX, and much more, It is a lightweight way to send system and service statistics. ...

collectd alternatives & related posts

StatsD

305

31

Simple daemon for easy stats aggregation

Stacks305

Votes31

PROS OF STATSD

9
Open source
7
Single responsibility
5
Efficient wire format
3
Handles aggregation
3
Loads of integrations
1
Many implementations
1
Scales well
1
Simple to use
1
NodeJS

CONS OF STATSD

1
No authentication; cannot be used over Internet

COMPARE

Compare StatsD vs collectd

related StatsD posts

Łukasz Korecki

CTO & Co-founder at EnjoyHQ · Apr 25, 2019 | 7 upvotes · 309.4K views

Shared insights

on

collectd

Google Compute Engine

Google Compute Engine

StatsD

Clojure

Stackdriver

at

We use collectd because of it's low footprint and great capabilities. We use it to monitor our Google Compute Engine machines. More interestingly we setup collectd as StatsD replacement - all our Clojure services push application-level metrics using our own metrics library and collectd pushes them to Stackdriver

Trey Tacon

Jul 1, 2019 | 6 upvotes · 936.9K views

Shared insights

on

Sentry

StatsD

Graphite

Grafana

PagerDuty

PagerDuty +1 more

at

A huge part of our continuous deployment practices is to have granular alerting and monitoring across the platform. To do this, we run Sentry on-premise, inside our VPCs, for our event alerting, and we run an awesome observability and monitoring system consisting of StatsD, Graphite and Grafana. We have dashboards using this system to monitor our core subsystems so that we can know the health of any given subsystem at any moment. This system ties into our PagerDuty rotation, as well as alerts from some of our Amazon CloudWatch alarms (we’re looking to migrate all of these to our internal monitoring system soon).

How Mixmax Uses Node and Go to Process 250M Events a day - Mixmax Tech Stack

Nagios

828

102

Complete monitoring and alerting for servers, switches, applications, and services

Stacks828

Votes102

PROS OF NAGIOS

53
It just works
28
The standard
12
Customizable
8
The Most flexible monitoring system
1
Huge stack of free checks/plugins to choose from

CONS OF NAGIOS

Be the first to leave a con

COMPARE

Compare Nagios vs collectd

related Nagios posts

Conor Myhrvold

Tech Brand Mgr, Office of CTO at Uber · Dec 4, 2018 | 15 upvotes · 5.2M views

Shared insights

on

Prometheus

Graphite

Grafana

Nagios

at

Uber Technologies

Why we spent several years building an open source, large-scale metrics alerting system, M3, built for Prometheus:

By late 2014, all services, infrastructure, and servers at Uber emitted metrics to a Graphite stack that stored them using the Whisper file format in a sharded Carbon cluster. We used Grafana for dashboarding and Nagios for alerting, issuing Graphite threshold checks via source-controlled scripts. While this worked for a while, expanding the Carbon cluster required a manual resharding process and, due to lack of replication, any single node’s disk failure caused permanent loss of its associated metrics. In short, this solution was not able to meet our needs as the company continued to grow.

To ensure the scalability of Uber’s metrics backend, we decided to build out a system that provided fault tolerant metrics ingestion, storage, and querying as a managed platform...

https://eng.uber.com/m3/

(GitHub : https://github.com/m3db/m3)

M3: Uber’s Open Source, Large-scale Metrics Platform for Prometheus

Rashid Ansari

Jan 25, 2022 | 3 upvotes · 243.3K views

Shared insights

on

Prometheus

Nagios

I am new to DevOps and looking for training in DevOps. Some institutes are offering Nagios while some Prometheus in their syllabus. Please suggest which one is being used in the industry and which one should I learn.

Ganglia

28

0

Scalable distributed monitoring system

Stacks28

Votes0

PROS OF GANGLIA

Be the first to leave a pro

CONS OF GANGLIA

Be the first to leave a con

COMPARE

Compare Ganglia vs collectd

related Ganglia posts

Fluentd

603

38

Unified logging layer

Stacks603

Votes38

PROS OF FLUENTD

11
Open-source
9
Easy
9
Great for Kubernetes node container log forwarding
9
Lightweight

CONS OF FLUENTD

Be the first to leave a con

COMPARE

Compare Fluentd vs collectd

related Fluentd posts

Prometheus

4.3K

239

An open-source service monitoring system and time series database, developed by SoundCloud

Stacks4.3K

Votes239

PROS OF PROMETHEUS

47
Powerful easy to use monitoring
38
Flexible query language
32
Dimensional data model
27
Alerts
23
Active and responsive community
22
Extensive integrations
19
Easy to setup
12
Beautiful Model and Query language
7
Easy to extend
6
Nice
3
Written in Go
2
Good for experimentation
1
Easy for monitoring

CONS OF PROMETHEUS

12
Just for metrics
6
Bad UI
6
Needs monitoring to access metrics endpoints
4
Not easy to configure and use
3
Supports only active agents
2
Written in Go
2
TLS is quite difficult to understand
2
Requires multiple applications and tools
1
Single point of failure

COMPARE

Compare Prometheus vs collectd

related Prometheus posts

Matt Menzenski

Senior Software Engineering Manager at PayIt · May 3, 2021 | 16 upvotes · 1.1M views

Shared insights

on

prom-client

Node.js

Dropwizard

Java

Kubernetes

Kubernetes +2 more

at

Grafana and Prometheus together, running on Kubernetes , is a powerful combination. These tools are cloud-native and offer a large community and easy integrations. At PayIt we're using exporting Java application metrics using a Dropwizard metrics exporter, and our Node.js services now use the prom-client npm library to serve metrics.

With multiplying microservices running on Kubernetes, PayIt turned to Grafana and Prometheus for observability at cloud native scale | Grafana Labs

Conor Myhrvold

Tech Brand Mgr, Office of CTO at Uber · Dec 4, 2018 | 15 upvotes · 5.2M views

Shared insights

on

Prometheus

Graphite

Grafana

Nagios

at

Uber Technologies

Why we spent several years building an open source, large-scale metrics alerting system, M3, built for Prometheus:

By late 2014, all services, infrastructure, and servers at Uber emitted metrics to a Graphite stack that stored them using the Whisper file format in a sharded Carbon cluster. We used Grafana for dashboarding and Nagios for alerting, issuing Graphite threshold checks via source-controlled scripts. While this worked for a while, expanding the Carbon cluster required a manual resharding process and, due to lack of replication, any single node’s disk failure caused permanent loss of its associated metrics. In short, this solution was not able to meet our needs as the company continued to grow.

To ensure the scalability of Uber’s metrics backend, we decided to build out a system that provided fault tolerant metrics ingestion, storage, and querying as a managed platform...

https://eng.uber.com/m3/

(GitHub : https://github.com/m3db/m3)

M3: Uber’s Open Source, Large-scale Metrics Platform for Prometheus

Munin

72

10

PnP networked resource monitoring tool that can help to answer the what just happened to kill our performance

Stacks72

Votes10

PROS OF MUNIN

3
Good defaults
2
Extremely fast to install
2
Alerts can trigger any command line program
2
Adheres to traditional Linux standards
1
Easy to write custom plugins

CONS OF MUNIN

Be the first to leave a con

COMPARE

Compare Munin vs collectd

related Munin posts

Telegraf

283

16

The plugin-driven server agent for collecting & reporting metrics

Stacks283

Votes16

PROS OF TELEGRAF

5
One agent can work as multiple exporter with min hndlng
5
Cohesioned stack for monitoring
2
Open Source
2
Metrics
1
Supports custom plugins in any language
1
Many hundreds of plugins

CONS OF TELEGRAF

Be the first to leave a con

COMPARE

Compare Telegraf vs collectd

related Telegraf posts

Metricbeat

49

3

A Lightweight Shipper for Metrics

Stacks49

Votes3

PROS OF METRICBEAT

2
Simple
1
Easy to setup

CONS OF METRICBEAT

Be the first to leave a con

COMPARE

Compare Metricbeat vs collectd

related Metricbeat posts

Sunil Chaudhari

Team Lead at XYZ · Jun 15, 2020 | 2 upvotes · 579.3K views

Shared insights

on

Elasticsearch

Metricbeat

Linux

Grafana

Prometheus

Hi, We have a situation, where we are using Prometheus to get system metrics from PCF (Pivotal Cloud Foundry) platform. We send that as time-series data to Cortex via a Prometheus server and built a dashboard using Grafana. There is another pipeline where we need to read metrics from a Linux server using Metricbeat, CPU, memory, and Disk. That will be sent to Elasticsearch and Grafana will pull and show the data in a dashboard.

Is it OK to use Metricbeat for Linux server or can we use Prometheus?

What is the difference in system metrics sent by Metricbeat and Prometheus node exporters?

Regards, Sunil.