Visão Geral
Este curso ensina, de forma prática e completa, como projetar, estruturar e implementar pipelines de streaming utilizando apenas Apache Flink SQL. Você aprenderá a conectar fontes e destinos, criar transformações contínuas, lidar com tempo de evento, construir janelas e entregar resultados em tempo real. É ideal para quem deseja criar pipelines profissionais sem precisar escrever código Java/Scala, usando somente SQL declarativo.
Conteúdo Programatico
Module 1 – Introduction to Streaming Pipelines
- What is a streaming pipeline?
- Why build pipelines using Flink SQL?
- Streaming vs batch data flows
- Architectural patterns for pipelines
Module 2 – Defining Sources and Sinks
- SQL DDL for Kafka sources
- JDBC connectors
- File and object storage connectors
- Configuring schemas, formats and watermarks
Module 3 – Transformations with SQL
- Filtering and projections
- Computed columns
- Enriching data with reference tables
- Handling changelogs
Module 4 – Time Semantics in Pipelines
- Event time vs processing time
- Configuring watermarks
- Late event handling
- Practical strategies for time-sensitive streams
Module 5 – Building Windowed Pipelines
- Tumbling, hopping and cumulative windows
- Session windows
- Window aggregations
- Multi-stage windowed transformations
Module 6 – Joining Streams and Tables
- Stream–stream joins
- Temporal joins
- Lookup table joins
- Performance considerations for large joins
Module 7 – Multi-Step Pipeline Design
- Designing multi-hop pipelines
- Chaining transformations across stages
- Creating reusable intermediate tables
- Ensuring consistency between pipeline layers
Module 8 – End-to-End Pipeline Implementation
- Kafka → Flink SQL → Kafka
- Multi-sink pipelines (Kafka + database + dashboard)
- Real-case pipeline walkthroughs
- Error handling and reliability patterns
Module 9 – Optimization and Performance
- Query EXPLAIN and plan analysis
- Reducing backpressure
- Memory and state tuning
- Scalability strategies
Module 10 – Production Best Practices
- Deploying SQL pipelines in production environments
- Schema evolution and governance
- Monitoring with metrics and logs
- Operational guidelines for stable pipelines