Datadog

DevOps / Monitoring / Performance Monitoring

Paurush Rai

Full Stack Developer at Fuelbuddy·Feb 19, 2024

Needs advice

Datadog

Sentry

and

Stackdriver

Need advice on this.

Which one should I use for logging and error monitoring ( Datadog / Sentry / Stackdriver )?

Open to any other solutions.

4 upvotes·34.4K views

Replies (1)

Amir Mehler

DevOps Manager at AI21 Labs·Mar 2, 2024

Stackdriver is no longer a product on it's own. It's the "Monitoring" feature in Google cloud. If you're on Google cloud - it's the easiest way to set up logging.
Datadog is one of the 3 most expensive tools for logging, It's a SaaS product like Splunk and New Relic. Only use them if you have high budget and consider your logs at least as holy as the bible.
Sentry is not related to logging, but it can report errors and crashes in your application. Mixing up those tools tells me you're not really familiar with monitoring in general. I suggest learning a bit more, it's easy to get monitoring wrong and ending up shipping and storing masses of useless logs. Good luck.

3 upvotes·129 views

Needs advice

and

We wanted to have a tool that has both APM and error monitoring stuff included in one.

I preferred to go to New Relic as the free version offers 100gb storage but have seen some "not good" comments about it compared to Datadog.

My teammates suggested AppSignal.

So which one should be a good option to give a try?

Tech Stach -> Ruby on Rails, Rails-react, Amazon EC2 machine.

5 upvotes·33.8K views

Needs advice

and

Hello :) We are using Datadog on Kong to monitor the metrics and analytics.

We feel that the cost associated with Datadog is high in terms of custom metrics and indexations. So, we planned to find an alternative for Datadog and we are looking into Grafana implementation with kong.

Will the shift from Datadog to Grafana be a wise move and flawless?

5 upvotes·66.8K views

Replies (1)

Giorgio Torres

Software Engineer ·Jun 9, 2023

Recommends

Grafana

Prometheus

My Stack

Hi Sathish,

Straightforward responding you: Will the shift from Datadog to Grafana be a wise move? R: Maybe, specially in the long run.

and flawless? R: It depends, but I don't think so. And it also won't be so smooth. You'll reach a point of Datatog deprec where your team will be supporting both, and at this peak will be when you'll think it was a bad move, because you'll be paying for Datadog and Prometheus/Grafana servers, plus your team will have to work on new system metrics and fine tuning metric queries and graphs on Grafana, so you can expect a delay on business deliveries. Your team will feel overburdened.

So bear in mind that basically you'll be moving from an out of the box monitoring solution to your own. The analogy I'd do is to moving out from a cloud provider to your own self managed servers, though way less complex of course.

Grafana itself won't do the trick. You'll need a data scrapper system like Prometheus to collect metrics/data from your app, or use a middleware Pushgateway to receive these data so Prometheus can scrape them. To do this you'll need some backend work to expose metrics data, instead of HTTP post your metrics data to Datadog, so you can expect a little re-architecture or engineering on that, depending on how your system is designed.

In the long run, you'll probably need a team focused only on that: to take care of the Prometheus/Grafana updates as well as their servers (after all they're still systems that need to be managed).

I think all of this will depend on the size of your company now and your teams (which I advise you to include them on that decision), and your budget plus rush for business deliveries. In the long run it'll pay, for your company will be more specialized on this stack, and you can have a team focused on that plus developer support.

If you wanna go with it, I'd suggest to do a small PoC (which still will need a good amount of work). Select a project (or even an endpoint/ mobile feature) to have both integrations: Datadog plus Grafana, and do the same graphs you have on Datadog to Grafana. I think this way you can have a better picture of the reality on how that will be for your company.

1 upvote·1 comment·1.3K views

Giorgio Torres

June 9th 2023 at 9:46AM

I also posted your question on ChatGPT, and here's its full response on that, which I also agree:

```Shifting from Datadog to Grafana for monitoring metrics and analytics can be a reasonable alternative, depending on your specific requirements and use case. Grafana is a popular open-source platform that provides rich visualization and monitoring capabilities. However, it's important to note that Grafana and Datadog have different feature sets and focus areas. Here are a few considerations to help you evaluate the potential shift:

1. Feature Comparison: Evaluate the features and functionalities offered by Grafana and compare them to the ones you currently use in Datadog. Ensure that Grafana meets your specific monitoring, analytics, and visualization needs.

2. Custom Metrics and Indexing: Understand your requirements for custom metrics and indexing in Grafana. Ensure that Grafana provides the necessary flexibility and support for the metrics and indexes you require. Check if Grafana integrates well with Kong and offers plugins or extensions for capturing custom metrics.

3. Learning Curve: Consider the learning curve associated with Grafana, as your team will need to become familiar with its interface, configuration, and query language (e.g., Prometheus Query Language). Assess whether your team has the required expertise or if additional training will be necessary.

4. Data Storage and Scalability: Evaluate how Grafana handles data storage and scalability. Determine if it can handle the volume of metrics and analytics data generated by Kong and if it integrates seamlessly with your existing data storage solutions.

5. Alerting and Notification: Determine if Grafana's alerting and notification features meet your requirements. Check if it supports real-time alerting, customizable thresholds, and various notification channels (e.g., email, Slack).

6. Ecosystem and Community Support: Consider the ecosystem and community support around Grafana. It has a large user community, which means you can find resources, plugins, and community-contributed dashboards to enhance your monitoring capabilities.

7. Cost Comparison: Evaluate the cost aspect of Grafana compared to Datadog. While Grafana itself is open source, you need to consider the costs associated with additional components, such as data storage, alerting systems, and any necessary infrastructure changes. Calculate the total cost of ownership and compare it to your current expenses with Datadog.

It's important to conduct a thorough evaluation and possibly run a proof of concept or pilot project to ensure that Grafana meets your specific needs before fully transitioning from Datadog.```

Needs advice

and

We are evaluating an APM tool and would like to select between AppDynamics or Datadog. Our applications are largely hosted on Microsoft Azure but we would keep the option to move to AWS or Google Cloud Platform in the future.

In addition to core Azure services, we will be hosting other components - including MongoDB, Keycloak, PagerDuty, etc. Our applications are largely C# and React-based using frontend for Backend patterns and Azure API gateway. In addition, there are close to 50+ external services integrated using both REST and SOAP.

4 upvotes·1M views

Greg Smethells

CTO and Software Architect at Medstrat·May 9, 2022

Needs advice

and

(

)

Looking for the most straightforward APM tool to use with a Python CRUD stack. We would like to have flame graphs of individual transactions. We'd also like to see SRE tools for monitoring, alerting, and incident response. We are leaning towards Datadog. Thoughts?

2 upvotes·29.6K views

Nikola Tokic

Software Engineer ·Mar 2, 2022

Needs advice

Flask

connexion

and

Quart

I'm considering moving from Flask to Quart, does anyone have some experience with this migration?

I expect possible problems with connexion which we use as OpenAPI specification.

Would be good if someone can point downsides of moving to the Quart framework so I can double-check if my plan is worth doing.

Other libs and tools used in the project: SQLAlchemy, alembic, PostgreSQL, Datadog

cons for now:

Refactoring uncertainty (not sure how big of a task is it)
Connexion might not work with Quart (moving to another library)
...

8 upvotes·151.8K views

Needs advice

and

We are migrating from New Relic to Datadog... Is there a way I can export all existing alerts in an easy way from New Relic to Datadog?

3 upvotes·33.7K views

Dewald Swanepoel

Software Engineer ·Apr 8, 2021

Needs advice

Datadog

and

StatsD

I see StatsD is commonly used in conjunction with Datadog. In fact, Datadog even has their own StatsD daemon (called DogStatsD) embedded in the DataDog agent. Can someone explain to me what it is that StatsD gives you which you don't already have with Datadog's APM and distributed tracing functionality?

4 upvotes·51.7K views

Replies (1)

Frédéric MARAND

Core Developer at OSInet·Apr 18, 2021

The Datadog statsd agent is not really a normal statsd client: it implements a large subset of the original (etsy) features, but also some Datadog-specific features (about histograms). It is used to send metrics to Datadog APM, and its big advantage is the developer experience, which is familiar and easy to use, just like any statsd client, making it trivial to replace an existing statsd metrics client in any application with the Datadog version to publish metrics to Datadog.

1 upvote·2.3K views

Maurice Ruff

Cloud Architect ·Mar 18, 2021

Needs advice

Amazon CloudWatch

Datadog

and

New Relic

We build everything in AWS around microservices and are looking at Amazon CloudWatch, Datadog, and New Relic. Which one would work best for our situation?

3 upvotes·34.1K views

Replies (3)

Anders Turner

Mar 25, 2021

Recommends

Splunk

Via acquisitions and internal product developments over the last year+, Splunk provides really differentiated APM and monitoring for microservices and AWS. I'd recommend giving it a peak if you haven't yet! For some validation, a recent Cloud Observability vendor report by GigaOm came out and ranked Splunk as the "top performer" in the space. Hope this helps in your search

DevOps Observability, Analysis and Insight | Splunk (splunk.com)

3 upvotes·818 views

Justin Horine

Commercial Business Developmen at Datadog·Mar 24, 2021

Recommends

Datadog

Maurice - Datadog allows you to monitor microservices by unifying metrics, traces, and logs all in one place. Enabling you to visualize how services interact at a glance so you can quickly locate problems and begin troubleshooting. On top of that you can monitor complex dependencies, investigate possible points of failure, and set smart alerts for proactive monitoring. In my opinion Datadog offers the most comprehensive tool with the simplest UI.

2 upvotes·554 views

View all (3)

Kamil Kowalski

Lead Architect at Fresha·Feb 8, 2021

Migrated

from

(

)

Coming from a Ruby background, we've been users of New Relic for quite some time. When we adopted Elixir, the New Relic integration was young and missing essential features, so we gave AppSignal a try. It worked for quite some time, we even implemented a :telemetry reporter for AppSignal . But it was difficult to correlate data in two monitoring solutions, New Relic was undergoing a UI overhaul which made it difficult to use, and AppSignal was missing the flexibility we needed. We had some fans of Datadog, so we gave it a try and it worked out perfectly. Datadog works great with Ruby , Elixir , JavaScript , and has powerful features our engineers love to use (notebooks, dashboards, very flexible alerting). Cherry on top - thanks to the Datadog Terraform provider everything is written as code, allowing us to collaborate on our Datadog setup.

3 upvotes·245.3K views