Visão Geral
O curso Kafka Connect Fundamentals apresenta os principais conceitos, arquitetura, configuração e operação do Kafka Connect, a ferramenta oficial do Apache Kafka para integração de dados em larga escala. O aluno aprenderá como criar pipelines de dados confiáveis, conectar sistemas externos ao Kafka, configurar conectores, monitorar tarefas, lidar com erros e implementar boas práticas utilizadas no mercado.
Este curso combina teoria e prática em laboratório individual, permitindo que cada participante implemente e teste conectores em um ambiente real.
Conteúdo Programatico
1. Introduction to Kafka Connect
- What is Kafka Connect
- Key benefits and use cases
- Architecture overview
- Source vs. Sink connectors
2. Kafka Connect Architecture
- Standalone mode
- Distributed mode
- Workers and tasks
- Connect clusters
- Internal topics: config, offsets, status
3. Installing and Configuring Kafka Connect
- Installing Kafka Connect
- Worker configuration files
- Using Docker for Connect deployments
- REST API fundamentals
4. Working with Connectors
- Connector structure and lifecycle
- Creating connectors via REST API
- Updating and deleting connectors
- Connector plugins and plugin paths
5. Source Connectors
- How source connectors ingest data
- Common source connectors (JDBC, FilePulse, Debezium, etc.)
- Real-world source connector examples
6. Sink Connectors
- How sink connectors deliver data
- Common sink connectors (Elasticsearch, S3, JDBC, etc.)
- Real-world sink connector examples
7. Transformations (SMTs – Single Message Transformations)
- Built-in SMTs
- Applying transformations to source/sink connectors
- Custom SMT development overview
8. Data Schemas and Converters
- Schema Registry overview
- Avro, JSON, and Protobuf converters
- Schema evolution in Kafka Connect
9. Monitoring and Observability
- REST API monitoring
- Metrics and JMX
- Logging and debugging connectors
- Identifying and handling task failures
10. Error Handling and Retry Strategies
- Dead Letter Queue (DLQ)
- Retry logic
- Error tolerance settings
11. Security in Kafka Connect
- Authentication (SSL, SASL)
- Authorization (ACLs)
- Encrypting traffic and credentials
12. Scaling and High Availability
- Scaling Connect workers
- Distributed workloads
- Best practices for production deployments
13. Hands-on Labs
- Deploying a Connect cluster
- Creating a JDBC Source Connector
- Creating an Elasticsearch Sink Connector
- Applying SMT transformations
- Monitoring connectors using REST API
- Troubleshooting real scenarios
14. Best Practices and Architecture Patterns
- Recommended production configurations
- Connect cluster sizing
- Schema management strategy
- Designing reliable data pipelines