Visão Geral
Este Curso Full stack observability with Grafana with Prometheus, aborda observabilidade full stack utilizando Prometheus para coleta e armazenamento de métricas e Grafana para visualização, análise e correlação de dados. O foco está em ambientes modernos baseados em cloud, containers, Kubernetes e microsserviços, cobrindo métricas, logs, traces e alertas. O curso apresenta desde fundamentos até práticas avançadas de monitoramento, troubleshooting e SRE, com uso de exporters, PromQL, dashboards, alerting e integração com sistemas distribuídos.
Conteúdo Programatico
Module 1: Observability Fundamentals
- Observability vs Monitoring
- Metrics, Logs and Traces
- Golden Signals and RED Method
- SLIs, SLOs and SLAs
Module 2: Prometheus Architecture
- Prometheus internal architecture
- Time-series data model
- Scraping and service discovery
- Pull vs Push model
Module 3: Instrumenting Applications
- Application metrics fundamentals
- Client libraries overview
- Instrumenting backend services
- Instrumenting frontend applications
Module 4: Exporters and Infrastructure Monitoring
- Node Exporter
- Blackbox Exporter
- Database exporters
- Custom exporters
Module 5: PromQL in Depth
- PromQL data types
- Instant and range vectors
- Aggregation and functions
- Advanced PromQL patterns
Module 6: Grafana Fundamentals
- Grafana architecture
- Data sources configuration
- Dashboard basics
- Visualization types
Module 7: Advanced Grafana Dashboards
- Dashboard design best practices
- Variables and templating
- Annotations and correlations
- Performance optimization
Module 8: Alerting and Incident Response
- Alertmanager architecture
- Alert rules and routing
- Alert fatigue and best practices
- Incident response workflows
Module 9: Kubernetes Observability
- Kubernetes metrics overview
- kube-state-metrics
- Monitoring pods, nodes and services
- Capacity planning and autoscaling
Module 10: Distributed Systems and Tracing
- Introduction to distributed tracing
- Metrics and traces correlation
- Latency and dependency analysis
- Troubleshooting distributed systems
Module 11: Full Stack Observability Use Cases
- Monitoring microservices architectures
- Frontend performance monitoring
- Database and cache observability
- End-to-end request visibility
Module 12: Observability at Scale and SRE Practices
- High availability Prometheus setups
- Long-term storage and federation
- Observability for SRE teams
- Production best practices