Visão Geral
O Curso Kafka Schema Registry and Avro foi desenvolvido para profissionais que desejam dominar a serialização de dados e o gerenciamento de esquemas em ambientes Apache Kafka.
Durante o treinamento, os participantes aprenderão a trabalhar com o Confluent Schema Registry, utilizar o formato Avro para troca eficiente de mensagens e garantir a compatibilidade e integridade dos dados entre produtores e consumidores.
O curso combina conceitos teóricos, exemplos práticos e laboratórios reais, permitindo ao aluno construir pipelines de dados consistentes e evolutivos.
Conteúdo Programatico
Module 1: Introduction to Schema Management
- Why schema management is critical in data streaming
- Common serialization formats: JSON, Avro, Protobuf
- Overview of Confluent Schema Registry
- Schema evolution and compatibility types
Module 2: Understanding Avro
- What is Apache Avro and how it works
- Avro data structure: schema, record, field types
- Binary serialization vs. JSON serialization
- Advantages of Avro in Kafka pipelines
Module 3: Working with Confluent Schema Registry
- Schema Registry architecture and components
- Schema subjects and versioning
- Registering, retrieving, and deleting schemas
- Using Schema Registry REST API
Module 4: Producing and Consuming Avro Messages
- Configuring Avro producers and consumers
- Schema-aware serialization and deserialization
- Error handling and fallback strategies
- Hands-on: building a producer/consumer app using Avro
Module 5: Schema Compatibility and Evolution
- Compatibility modes: backward, forward, full, none
- Managing schema evolution and version control
- Handling schema breaking changes
- Best practices for maintaining data compatibility
Module 6: Advanced Topics and Integrations
- Working with Protobuf and JSON Schema
- Integrating Schema Registry with Kafka Connect
- Using Avro in stream processing (Kafka Streams, ksqlDB)
- Monitoring and securing Schema Registry
Module 7: Hands-On Project
Project: Build a complete Kafka data pipeline using Avro and Schema Registry to serialize, store, evolve, and consume data across multiple applications with backward and forward compatibility.