Curso DevOps and Site Reliability Engineering

  • DevOps | CI | CD | Kubernetes | Web3

Curso DevOps and Site Reliability Engineering

24 horas
Visão Geral

Curso: DevOps and Site Reliability Engineering (SRE) Este curso combina os princípios fundamentais do DevOps e da Engenharia de Confiabilidade de Sites (SRE), proporcionando aos alunos uma compreensão abrangente de como as equipes de desenvolvimento e operações podem colaborar para melhorar a eficiência, reduzir falhas e aumentar a confiabilidade dos sistemas. O curso abrange desde automação de infraestrutura até monitoramento, alertas e práticas de SRE para garantir a disponibilidade e escalabilidade de aplicativos críticos.

Objetivo

Após realizar o curso DevOps and Site Reliability Engineering, você será capaz de:

  • Compreender a cultura e os princípios do DevOps e da SRE
  • Implementar pipelines de CI/CD para automação de builds e deploys
  • Gerenciar a infraestrutura como código usando ferramentas como Terraform e Ansible
  • Monitorar e manter a confiabilidade de sistemas em ambientes de produção
  • Utilizar técnicas de automação para melhorar a disponibilidade de serviços
  • Configurar sistemas de monitoramento e alerta para detecção precoce de falhas
  • Aplicar práticas de engenharia de confiabilidade, como SLOs, SLIs e SLAs
  • Trabalhar com ferramentas de observabilidade e logging para análise de desempenho
Publico Alvo
  • Engenheiros de DevOps
  • Engenheiros de SRE
  • Desenvolvedores de Software
  • Administradores de Sistemas
  • Arquitetos de Soluções
  • Profissionais interessados em melhorar a confiabilidade e automação em ambientes de TI
Pre-Requisitos
  • Familiaridade com sistemas operacionais (Linux/Windows)
  • Conhecimento básico de infraestrutura em nuvem
  • Experiência em ferramentas de automação e monitoramento
Materiais
Inglês/Português/Lab Prático
Conteúdo Programatico

Module 1: Introduction to DevOps and SRE

  1. What is DevOps? Cultural and Technical Aspects
  2. What is Site Reliability Engineering (SRE)?
  3. The Relationship Between DevOps and SRE
  4. Key Principles of SRE: Reliability, Scalability, and Automation

Module 2: CI/CD Pipeline Automation

  1. Introduction to Continuous Integration and Continuous Deployment (CI/CD)
  2. Tools for CI/CD: Jenkins, GitLab CI, CircleCI, etc.
  3. Creating Pipelines for Automated Builds and Deployments
  4. Best Practices for Managing CI/CD Pipelines

Module 3: Infrastructure as Code (IaC)

  1. Introduction to IaC and Its Benefits in DevOps
  2. Working with Terraform for Infrastructure Automation
  3. Automating Configuration Management with Ansible
  4. Case Studies: Implementing IaC in Cloud Environments

Module 4: Containerization and Orchestration

  1. Introduction to Docker and Containers
  2. Orchestrating Containers with Kubernetes
  3. Managing Kubernetes Clusters: Deployment, Scaling, and Monitoring
  4. Case Study: Deploying Microservices with Kubernetes

Module 5: Monitoring, Alerting, and Observability

  1. Importance of Monitoring in DevOps and SRE
  2. Tools for Monitoring: Prometheus, Grafana, and ELK Stack
  3. Setting Up Alerts for Proactive Monitoring
  4. Logging and Observability: Analyzing System Performance and Health

Module 6: SRE Concepts and Practices

  1. Defining Service Level Objectives (SLOs) and Service Level Indicators (SLIs)
  2. Error Budgets and Their Role in Managing Reliability
  3. Implementing Redundancy and Failover Mechanisms
  4. Balancing Reliability with Feature Velocity

Module 7: Automating Operations and Incident Response

  1. Automating System Administration Tasks
  2. Tools for Automating Incident Response (PagerDuty, OpsGenie)
  3. Creating Playbooks for Handling Incidents
  4. Post-Incident Reviews: Learning from Failures

Module 8: Scaling Infrastructure for Reliability

  1. Designing Systems for Scalability
  2. Horizontal vs. Vertical Scaling
  3. Load Balancing and Traffic Distribution
  4. Auto-scaling with Kubernetes and Cloud Providers

Module 9: Security and Compliance in DevOps

  1. Integrating Security into DevOps Pipelines (DevSecOps)
  2. Automating Security Testing in CI/CD Pipelines
  3. Compliance and Governance in DevOps
  4. Managing Secrets and Sensitive Data

Module 10: High Availability and Disaster Recovery

  1. Designing Systems for High Availability
  2. Backup and Disaster Recovery Planning
  3. Implementing Active-Passive and Active-Active Architectures
  4. Testing and Validating Disaster Recovery Plans

Module 11: Advanced SRE Techniques

  1. Chaos Engineering: Testing System Resilience
  2. Distributed Systems: Ensuring Reliability Across Multiple Datacenters
  3. Reducing Toil with Automation and Self-Healing Systems
  4. Continuous Improvement in SRE Practices

Module 12: Final Project and Case Studies

  1. Real-world Case Study: Implementing DevOps and SRE in an Enterprise
  2. Final Project: Building a CI/CD Pipeline with Reliability Practices
  3. Course Summary and Best Practices
  4. Q&A and Discussion on Future Trends in DevOps and SRE
TENHO INTERESSE

Cursos Relacionados

Curso Ansible Red Hat Basics Automation Technical Foundation

16 horas

Curso Terraform Deploying to Oracle Cloud Infrastructure

24 Horas

Curso Ansible Linux Automation with Ansible

24 horas

Ansible Overview of Ansible architecture

16h

Advanced Automation: Ansible Best Practices

32h