Visão Geral
O Curso Datadog APM Fundamentals introduz os conceitos principais de Application Performance Monitoring (APM) utilizando a plataforma Datadog.
Os participantes aprenderão a instalar, configurar e utilizar o Datadog APM para monitorar aplicações, identificar gargalos, analisar desempenho, rastrear requisições distribuídas e aplicar práticas de observabilidade modernas em ambientes cloud, containers e microservices.
Conteúdo Programatico
Module 1: Introduction to Datadog APM
- What is Datadog and its observability ecosystem
- APM fundamentals and key concepts
- Use cases for distributed tracing
- Overview of APM features and capabilities
Module 2: Installing and Configuring the Datadog Agent
- Understanding the Datadog Agent
- Installing the agent on Linux, Windows, Docker, and Kubernetes
- Configuring APM and enabling tracing
- Validating APM data ingestion
Module 3: Instrumenting Applications for Tracing
- Automatic instrumentation for supported languages
- (Java, Python, Node.js, Go, .NET, Ruby, PHP)
- Manual instrumentation with Datadog SDK
- Tagging and enriching spans
- Handling sampling, retention and configuration settings
Module 4: Distributed Tracing Essentials
- How traces and spans work
- Understanding latency, errors, and performance metrics
- Viewing and analyzing trace details
- Trace search and analytics
- End-to-end transaction visibility
Module 5: Service Map and Dependencies
- Introduction to Service Map
- Visualizing service dependencies
- Identifying bottlenecks and slow services
- Analyzing upstream and downstream impacts
Module 6: Metrics, Logs, and Trace Correlation
- Correlating logs, metrics, and traces
- Enabling log injection
- Unified observability concepts
- Troubleshooting cross-service issues
Module 7: Datadog Continuous Profiler
- Introduction to code profiling
- CPU, memory, I/O and lock contention analysis
- Finding hotspots in application code
- Performance optimization workflows
Module 8: Dashboards, Alerts, and SLOs
- Creating custom dashboards for APM monitoring
- Using monitors to detect anomalies, latency spikes, and errors
- Defining SLIs and SLOs for applications
- Best practices for alerting and noise reduction
Module 9: Best Practices and Common Pitfalls
- Optimizing tracing performance and cost
- Handling high throughput environments
- Security considerations and data protection
- Real-world troubleshooting scenarios
Module 10: Capstone Project
- Instrumenting a multi-service application
- Analyzing performance with APM, Logs, Metrics, Service Map and Profiler
- Creating dashboards and alerts
- Presenting final observability results