PagerDuty is an alarm aggregation and dispatching service for system administrators and support teams. It collects alerts from your monitoring tools, gives you an overall view of all of your monitoring alarms, and alerts an on duty engineer if there's a problem. | Datadog is the leading service for cloud-scale monitoring. It is used by IT, operations, and development teams who build and operate applications that run on dynamic or hybrid cloud infrastructure. Start monitoring in minutes with Datadog! |
Alerting that works (and wakes you up)- When your systems go down, PagerDuty will wake you up. You choose how you want to be alerted - via phone, SMS or email, to multiple numbers, with retries.;Integrate all your existing monitoring tools- PagerDuty works great with almost all monitoring tools including: Nagios (and Icinga), Keynote, New Relic, Pingdom, Circonus, Red Gate SQL Monitor, Server Density, Zenoss, Monit, Munin, SolarWinds and many others. If it can send email, it will work with PagerDuty.;Native apps with push notifications- iOS and Android native apps with push notifications and a cross-platform mobile website ensure you can respond to alerts wherever you are, even on the go.;On-call duty scheduling- Easily set up schedules to fairly share on-call duty responsibilities with your team.;Automatic escalation of alerts- If you're paged but don't respond in time, the alert is auto-escalated to a team member. Ensures nothing slips through the cracks - ever.;Reliable, distributed architecture- PagerDuty's infrastructure is fully replicated in multiple data centers, with fast failover when problems occur.;Works internationally (Yes, really!)- Phone alerts can be delivered to over 170 countries and territories; SMS alerts are available virtually world-wide. (Is my country included?) | 14-day Free Trial for an unlimited number of hosts;200+ turn-key integrations for data aggregation;Clean graphs of StatsD and other integrations;Slice and dice graphs and alerts by tags, roles, and more;Easy-to-use search for hosts, metrics, and tags;Alert notifications via e-mail and PagerDuty;Receive alerts on any metric, for a single host or an entire cluster;Full API access in more than 15 languages;Overlay metrics and events across disparate sources;Out-of-the-box and customizable monitoring dashboards;Easy way to compute rates, ratios, averages, or integrals;Sampling intervals of 10 seconds;Mute all alerts with 1 click during upgrades and maintenance;Tools for team collaboration |
Statistics | |
Stacks 1.0K | Stacks 9.6K |
Followers 703 | Followers 8.2K |
Votes 119 | Votes 861 |
Pros & Cons | |
Pros
Cons
| Pros
Cons
|
Integrations | |

The world’s best software and DevOps teams rely on New Relic to move faster, make better decisions and create best-in-class digital experiences. If you run software, you need to run New Relic. More than 50% of the Fortune 100 do too.

Raygun gives you a window into how users are really experiencing your software applications. Detect, diagnose and resolve issues that are affecting end users with greater speed and accuracy.

AppSignal gives you and your team alerts and detailed metrics about your Ruby, Node.js or Elixir application. Sensible pricing, no aggressive sales & support by developers.

AppDynamics develops application performance management (APM) solutions that deliver problem resolution for highly distributed applications through transaction flow monitoring and deep diagnostics.

Stackify offers the only developers-friendly innovative cloud based solution that fully integrates application performance management (APM) with error and log. Allowing them to easily monitor, detect and resolve application issues faster

Skylight is a smart profiler for your Rails apps that visualizes request performance across all of your servers.

Librato provides a complete solution for monitoring and understanding the metrics that impact your business at all levels of the stack. We provide everything you need to visualize, analyze, and actively alert on the metrics that matter to you.

PM2 is a production process manager for Node.js applications with a built-in load balancer. It allows you to keep applications alive forever, to reload them without downtime and to facilitate common system admin tasks.

VictorOps is a real-time incident management platform that combines the power of people and data to embolden DevOps teams so they can handle incidents as they occur and prepare for the next one.

It is an AI-powered, full stack, automated performance management solution. It provides user experience analysis that identifies and resolves application performance issues faster than ever before.