Visão Geral
Este Curso Apache Flink Foundation apresenta os fundamentos do Apache Flink, uma plataforma moderna e distribuída para processamento de dados em tempo real e em batch. Você aprenderá como o Flink funciona, seus componentes principais, como desenvolver aplicações de streaming e como integrá-lo a ecossistemas de dados corporativos.
Conteúdo Programatico
Module 1 – Introduction to Apache Flink
-
What is Apache Flink?
- Batch vs Stream processing
- Core concepts and terminology
- Flink ecosystem overview
Module 2 – Flink Architecture
-
Distributed runtime
- JobManager and TaskManager roles
- Parallelism and scaling
- Checkpoints and state management
Module 3 – Development Environment Setup
-
Installing Flink locally
- Project structure and dependencies
- Working with Flink APIs (Java, Python, Scala)
Module 4 – DataStream API
-
Streams and transformations
- Operators, functions, and windows
- Event time vs processing time
- Stateful computations
Module 5 – DataSet API (Batch Processing)
-
DataSet creation
- Transformations
- Execution plans
- When to prefer batch or streaming
Module 6 – Connectors and Integrations
-
Kafka, Kinesis, RabbitMQ
- File systems (HDFS, S3, local)
- JDBC, NoSQL, and data sinks
- Using Flink SQL
Module 7 – Deployment & Monitoring
-
Running jobs on local, standalone, and cluster modes
- Flink Dashboard overview
- Logs, job metrics, and health monitoring
- Introduction to Flink on Kubernetes
Module 8 – Optimization & Best Practices
-
Checkpoint tuning
- Memory and performance optimization
- Handling backpressure
- Fault tolerance strategies
Module 9 – Hands-On Project
-
Designing a real-time streaming pipeline
- Integrating with Kafka
- Processing real-time events
- Deploying a complete streaming job