Datadog vs Grafana

Datadog
Datadog

1.9K
1.3K
694
Grafana
Grafana

2.6K
1.7K
303
Add tool

Datadog vs Grafana: What are the differences?

Developers describe Datadog as "Unify logs, metrics, and traces from across your distributed infrastructure". Datadog is the leading service for cloud-scale monitoring. It is used by IT, operations, and development teams who build and operate applications that run on dynamic or hybrid cloud infrastructure. Start monitoring in minutes with Datadog!. On the other hand, Grafana is detailed as "Open source Graphite & InfluxDB Dashboard and Graph Editor". Grafana is a general purpose dashboard and graph composer. It's focused on providing rich ways to visualize time series metrics, mainly though graphs but supports other ways to visualize data through a pluggable panel architecture. It currently has rich support for for Graphite, InfluxDB and OpenTSDB. But supports other data sources via plugins.

Datadog can be classified as a tool in the "Performance Monitoring" category, while Grafana is grouped under "Monitoring Tools".

Some of the features offered by Datadog are:

  • 14-day Free Trial for an unlimited number of hosts
  • 200+ turn-key integrations for data aggregation
  • Clean graphs of StatsD and other integrations

On the other hand, Grafana provides the following key features:

  • Create, edit, save & search dashboards
  • Change column spans and row heights
  • Drag and drop panels to rearrange

"Monitoring for many apps (databases, web servers, etc)" is the primary reason why developers consider Datadog over the competitors, whereas "Beautiful" was stated as the key factor in picking Grafana.

Grafana is an open source tool with 29.3K GitHub stars and 5.55K GitHub forks. Here's a link to Grafana's open source repository on GitHub.

According to the StackShare community, Grafana has a broader approval, being mentioned in 558 company stacks & 313 developers stacks; compared to Datadog, which is listed in 532 company stacks and 213 developer stacks.

- No public GitHub repository available -

What is Datadog?

Datadog is the leading service for cloud-scale monitoring. It is used by IT, operations, and development teams who build and operate applications that run on dynamic or hybrid cloud infrastructure. Start monitoring in minutes with Datadog!

What is Grafana?

Grafana is a general purpose dashboard and graph composer. It's focused on providing rich ways to visualize time series metrics, mainly though graphs but supports other ways to visualize data through a pluggable panel architecture. It currently has rich support for for Graphite, InfluxDB and OpenTSDB. But supports other data sources via plugins.

Want advice about which of these to choose?Ask the StackShare community!

Why do developers choose Datadog?
Why do developers choose Grafana?

Sign up to add, upvote and see more prosMake informed product decisions

What are the cons of using Datadog?
What are the cons of using Grafana?
    Be the first to leave a con
    What companies use Datadog?
    What companies use Grafana?

    Sign up to get full access to all the companiesMake informed product decisions

    What tools integrate with Datadog?
    What tools integrate with Grafana?

    Sign up to get full access to all the tool integrationsMake informed product decisions

    What are some alternatives to Datadog and Grafana?
    New Relic
    New Relic is the all-in-one web application performance tool that lets you see performance from the end user experience, through servers, and down to the line of application code.
    Splunk
    Splunk Inc. provides the leading platform for Operational Intelligence. Customers use Splunk to search, monitor, analyze and visualize machine data.
    Prometheus
    Prometheus is a systems and service monitoring system. It collects metrics from configured targets at given intervals, evaluates rule expressions, displays the results, and can trigger alerts if some condition is observed to be true.
    AppDynamics
    AppDynamics develops application performance management (APM) solutions that deliver problem resolution for highly distributed applications through transaction flow monitoring and deep diagnostics.
    Sentry
    Sentry is an open-source platform for workflow productivity, aggregating errors from across the stack in real time. 500K developers use Sentry to get the code-level context they need to resolve issues at every stage of the app lifecycle.
    See all alternatives
    Decisions about Datadog and Grafana
    StackShare Editors
    StackShare Editors
    Flask
    AWS EC2
    Celery
    Datadog
    PagerDuty
    Airflow
    StatsD
    Grafana

    Data science and engineering teams at Lyft maintain several big data pipelines that serve as the foundation for various types of analysis throughout the business.

    Apache Airflow sits at the center of this big data infrastructure, allowing users to “programmatically author, schedule, and monitor data pipelines.” Airflow is an open source tool, and “Lyft is the very first Airflow adopter in production since the project was open sourced around three years ago.”

    There are several key components of the architecture. A web UI allows users to view the status of their queries, along with an audit trail of any modifications the query. A metadata database stores things like job status and task instance status. A multi-process scheduler handles job requests, and triggers the executor to execute those tasks.

    Airflow supports several executors, though Lyft uses CeleryExecutor to scale task execution in production. Airflow is deployed to three Amazon Auto Scaling Groups, with each associated with a celery queue.

    Audit logs supplied to the web UI are powered by the existing Airflow audit logs as well as Flask signal.

    Datadog, Statsd, Grafana, and PagerDuty are all used to monitor the Airflow system.

    See more
    Interest over time
    Reviews of Datadog and Grafana
    Review ofDatadogDatadog

    Background

    We're a real-time financial services messaging company, so being able to monitor our servers and applications in real-time is important to us. We also like a good deal, so $15/server seemed a bargain.

    What were we looking for?

    We wanted to monitor our MS infrastructure (servers, SQL) and apps (C#) to understand performance issues and be able to rectify. We also want to be able to do long-term trending. And we wanted to go from nothing to live in a short time.

    Experience

    Installing the Datadog agent on the servers was a breeze and enabling the integrations for SQL and Windows trivial.

    Using the StatsD based API was also very easy - no worrying about JSON or UDP calls. The ability to add tags to all metrics is also a key benefit. We run multiple (100+) instances of a single application and being able to distinguish events from each one via tagging, or to see aggregates, is extremely useful.

    In all it took 2 days R&D to instrument our key applications sufficiently for production deployment. Deploying the agent to our production servers took 30 mins, giving our Ops team complete visibility for the 1st time.

    What have we learned

    Since we've been live Datadog has given us numerous insights into the way our system behaves, from uneven server loadings and sporadic memory usage to performance tuning a key application that resulted in a 50% increase in throughput. Knowing what's taking the time has been a boon.

    Continuous evolution

    The other nice surprise has been the evolving nature of Datadog. It seems like every couple of weeks there's a new feature on the site.

    Other points

    • I like the transparent pricing. Services that won't show me the price without having to talk to a sales person are really annoying.
    • Support has been good. We've contacted them several times with questions and always had a quick response (time zone considered...we're in London) and a helpful answer.

    So What's bad?

    Probably the weakest aspect at the moment is the long term trending of data. Whilst you can wind the time bar back to see what happened last week you can't ask questions like "show me the peak period each day for the last x months". The "get data" API is also fairly weak. Neither are concerns at the moment, and I'm sure they're on the to-do list.

    Review ofDatadogDatadog

    I've been a systems administrator most of my career. Everywhere I went, I'd have to rebuild the same monitoring + graphing system. And then make sure that every machine wrote to that system and every application handed up the proper metrics through whatever mechanism seemed good at the time.

    Then, as CTO of SimpleReach, single-handedly managing over 200 servers in addition to everything else, I found Datadog. We were already using statsd to instrument our applications, now it was just a matter of getting that data to Datadog. We use Chef, so I installed the Datadog agent on every machine in about 10 minutes and we were up and running.

    The best part was that we had a deploy problem the next day with one of our main applications and troubleshooting took minutes instead of hours (and Datadog immediately paid for itself). Now no new features go out without instrumentation and no machine gets created without being monitored.

    Datadog just scales with us. Great service and I highly recommend it to anyone not looking to reinvent the wheel with monitoring and instrumentation.

    Review ofDatadogDatadog

    Datadog makes running a service with 800,000 unique users a month possible as a single developer/maintainer. I bought a separate monitor just to keep my datadog dashboards always visible and rely on triggers to keep watch over 20+ servers.

    Highly recommended.

    Review ofDatadogDatadog

    We use datadog to monitor our servers and some application metrics. Easy to get started and scale to many servers. Datadog support engineers are always quick to respond to bugs and other challenges.

    Review ofGrafanaGrafana

    analyze heap dump and many logging or traces

    How developers use Datadog and Grafana
    Avatar of DigitalPermits
    DigitalPermits uses DatadogDatadog

    We just started looking into Datadog, but from what we see, it's like New Relic meets Loggly. It's really easy to plugin different services (like the one on this list) and get detailed analysis of what is happening on your servers and services. It makes tracking down sparse and difficult to understand problems possible.

    Avatar of ShadowICT
    ShadowICT uses GrafanaGrafana

    We use Grafana to view live stats relating to our servers such as memory and CPU usage. We also use Grafana to monitor our gaming servers for data such as latency and player counts. This allows us to generate effective analytics and see when problems arise.

    Avatar of Andrew Gatenby
    Andrew Gatenby uses GrafanaGrafana

    Everyone likes graphs, right?! This isn't a tool we actively use right now, but paired with Prometheus we want to use it to have visual monitors on things like API cluster health, status, queue stats, DB/redis query and cache stats etc.

    Avatar of Scrayos UG (haftungsbeschränkt)
    Scrayos UG (haftungsbeschränkt) uses GrafanaGrafana

    Grafana is used in combination with Prometheus to display the gathered stats and to monitor our physical servers aswell as their virtual applications. We also use Grafana to get notifications about irregularities.

    Avatar of Sail Tactics
    Sail Tactics uses DatadogDatadog

    Monitoring day-to-day operations of multiple high-performance computing assets distributed across several networks. Monitoring vendor provided data and setting up alerts when things do not show up on time.

    Avatar of AngeloR
    AngeloR uses DatadogDatadog

    Datadog was used as an agent for monitoring and as for the statsd daemon included. This way we are able to have automated system stats and include whatever other metrics we want to track.

    Avatar of sapslaj
    sapslaj uses GrafanaGrafana

    Grafana takes the data from InfluxDB and presents it in a nice flexible format. Bonus points for built-in alerts and playlists (cycles through different dashboards automatically)

    Avatar of Bùi Thanh
    Bùi Thanh uses GrafanaGrafana
    • Graph report with many panels and Dashboard.
    • Easy to deploy, and view performance of system.
    • Intergrating with many datasource: Prometheus, CloudWatch
    • Alerts
    Avatar of Dynamictivity
    Dynamictivity uses DatadogDatadog

    Datadog is used because it has a great free tier and it provides us with great insights and integrations into our infrastructure and tools.

    Avatar of Flux Work
    Flux Work uses DatadogDatadog

    Powerful all-in-one monitoring solution as a service. Good integration with AWS. Very affordable price for small-scale startups.

    How much does Datadog cost?
    How much does Grafana cost?
    Pricing unavailable