Visão Geral
Este curso ensina como executar, gerenciar e escalar Kafka Connect utilizando Docker e Kubernetes, criando pipelines de dados modernos e altamente resilientes. Você aprenderá desde a containerização e orquestração até boas práticas de produção, incluindo observabilidade, segurança e automação.
Conteúdo Programatico
Module 1 — Introduction to Kafka Connect in Containerized Environments
- Why run Kafka Connect in containers
- Overview of Docker + Kubernetes for data engineering
- Kafka Connect architecture refresher
Module 2 — Running Kafka Connect with Docker
- Dockerizing Kafka Connect
- Creating Docker Compose environments
- Managing plugins in Docker images
- Deploying connectors through REST API
Module 3 — Advanced Docker Configurations
- Custom Dockerfiles for Connect workers
- Persistent volumes for offsets, configs & status
- Networking considerations
- Managing connector plugins at scale
Module 4 — Introduction to Kubernetes for Kafka Connect
- Core K8s concepts for data engineers
- Deployments, StatefulSets, Services
- ConfigMaps & Secrets
- Horizontal Pod Autoscaler basics
Module 5 — Deploying Kafka Connect on Kubernetes
- Creating Kafka Connect Deployment YAML
- Plugin management in Kubernetes
- Configuring distributed mode
- REST API access inside the cluster
Module 6 — Scaling Kafka Connect in Kubernetes
- Horizontal scaling with multiple workers
- Task assignment and rebalancing
- Failure handling & self-healing
- Best practices for resource requests/limits
Module 7 — Observability and Monitoring
- Logging strategies
- Metrics with Prometheus
- Dashboards with Grafana
- Alerting for connector failures
Module 8 — Security for Containerized Kafka Connect
- Securing API endpoints
- SSL/TLS configuration
- Secret management with Kubernetes Secrets
- Role-based access control (RBAC)
Module 9 — CI/CD and Automation
- Automating connector deployment with GitOps
- Using ArgoCD or Flux
- Automated image builds for Connect workers
- Rolling updates and versioned connectors
Module 10 — Integrating with Cloud Providers
- Running Kafka Connect on EKS, GKE, AKS
- Using cloud-managed Kafka services
- Best practices for hybrid cloud configurations
Module 11 — Real-World Streaming Pipelines
- Database → Kafka → Data Warehouse pipeline
- Managing schema evolution in Kubernetes
- Error handling, DLQs, and retry strategies
Module 12 — Capstone Project
- Build a full Kafka Connect cluster on Kubernetes
- Deploy a source and sink connector
- Apply monitoring, scaling, and security
- Final documentation & presentation