Curso Stream Processing Architectures with Apache Kafka Streams

  • DevOps | CI | CD | Kubernetes | Web3

Curso Stream Processing Architectures with Apache Kafka Streams

24 horas
Visão Geral

Este Curso Stream Processing Architectures with Apache Kafka Streams oferece uma imersão completa nas arquiteturas de processamento de stream com Apache Kafka Streams, abordando conceitos fundamentais, melhores práticas e as técnicas essenciais para construir e gerenciar pipelines de dados em tempo real. Ideal para profissionais de dados, engenheiros de software e arquitetos de soluções, este curso explora os componentes e recursos que fazem do Kafka Streams uma ferramenta poderosa para transformar grandes volumes de dados em informações acionáveis. Os participantes aprenderão a projetar, desenvolver e monitorar pipelines de dados distribuídos, escaláveis e resilientes, habilitando suas organizações a processar e responder a eventos em tempo real de maneira eficiente.

Objetivo

Após realizar este curso de Arquiteturas de Processamento de Stream com Apache Kafka Streams, você será capaz de:

  • Compreender os fundamentos do processamento de stream e o papel do Kafka Streams em arquiteturas orientadas a eventos
  • Configurar e operar uma aplicação Kafka Streams para processamento de dados em tempo real
  • Criar pipelines de dados escaláveis e resilientes com Apache Kafka Streams
  • Integrar Kafka Streams com outras ferramentas e plataformas de dados
  • Monitorar, otimizar e resolver problemas de pipelines de dados em ambientes de produção
Publico Alvo
  • Engenheiros de Dados
  • Desenvolvedores de Software
  • Arquitetos de Soluções e Sistemas
  • Profissionais de TI interessados em processamento de dados em tempo real
  • Especialistas em Apache Kafka que buscam aprofundar conhecimentos em Kafka Streams
Materiais
Inglês + Exercícios + Lab Pratico
Conteúdo Programatico

Module 1: Introduction to Stream Processing and Apache Kafka Streams

  1. Overview of Stream Processing Architectures
  2. Key Concepts in Real-Time Data Processing
  3. Understanding Apache Kafka and Kafka Streams
  4. Benefits and Limitations of Kafka Streams
  5. Apache Kafka Streams Use Cases and Applications

Module 2: Setting Up the Kafka Streams Environment

  1. Installing and Configuring Apache Kafka
  2. Kafka Streams API Setup
  3. Development Environment Configuration (Java/Scala)
  4. Managing Kafka Clusters and Partitions for Streaming

Module 3: Core Concepts of Kafka Streams

  1. Understanding Streams and Tables
  2. Kafka Streams DSL (Domain-Specific Language)
  3. Stream Processing Topology
  4. State Stores and Local Storage in Kafka Streams
  5. Stateless vs. Stateful Transformations

Module 4: Working with Kafka Streams API

  1. Using the StreamsBuilder API
  2. Implementing Common Transformations: Map, Filter, and Join
  3. Windowing Operations: Tumbling, Hopping, and Sliding Windows
  4. Aggregating and Reducing Stream Data
  5. Managing State and Querying State Stores

Module 5: Error Handling, Retries, and Exactly-Once Processing

  1. Fault Tolerance and Error Handling in Kafka Streams
  2. Configuring Retries and Backoff Strategies
  3. Processing Guarantees: At-Least-Once vs. Exactly-Once
  4. Transactions and Exactly-Once Semantics in Kafka Streams

Module 6: Advanced Stream Processing Techniques

  1. Joining Streams with KStream-KTable and KTable-KTable Joins
  2. Complex Event Processing (CEP) Patterns
  3. Using Kafka Streams Processor API for Custom Logic
  4. Integrating Machine Learning Models in Kafka Streams

Module 7: Integrating Kafka Streams with External Systems

  1. Kafka Connect for Data Integration
  2. Integrating Kafka Streams with NoSQL Databases (e.g., MongoDB, Cassandra)
  3. Connecting Kafka Streams to Relational Databases
  4. Stream Processing with Spark Streaming and Flink

Module 8: Performance Tuning and Optimization

  1. Resource Allocation and Configurations for High Performance
  2. Optimizing State Store Management
  3. Strategies for Scaling Kafka Streams Applications
  4. Monitoring Kafka Streams with Prometheus and Grafana
  5. Best Practices for Stream Processing in Production Environments

Module 9: Security and Access Control in Kafka Streams

  1. Securing Data in Transit and Data at Rest
  2. Configuring Authentication and Authorization
  3. Implementing SSL and SASL for Kafka Streams Applications
  4. Access Control Policies and Permissions

Module 10: Case Study and Practical Applications

  1. Real-World Implementation of Kafka Streams in Different Industries
  2. End-to-End Pipeline Design: Data Ingestion, Processing, and Visualization
  3. Hands-On Lab: Building and Deploying a Stream Processing Solution
  4. Review of Best Practices and Common Pitfalls
TENHO INTERESSE

Cursos Relacionados

Curso Ansible Red Hat Basics Automation Technical Foundation

16 horas

Curso Terraform Deploying to Oracle Cloud Infrastructure

24 Horas

Curso Ansible Linux Automation with Ansible

24 horas

Ansible Overview of Ansible architecture

16h

Advanced Automation: Ansible Best Practices

32h