Visão Geral
O Curso Data Engineering with Snowflake oferece uma formação completa para profissionais que desejam projetar, desenvolver e otimizar pipelines de dados na plataforma Snowflake Data Cloud.
Durante o treinamento, o participante aprenderá desde os conceitos fundamentais de arquitetura e modelagem de dados até a implementação prática de pipelines de ingestão, transformação, integração com ferramentas externas e otimização de performance.
O curso inclui laboratórios práticos, abordando casos reais de engenharia de dados corporativa, garantindo uma experiência imersiva e aplicável ao ambiente de produção.
Conteúdo Programatico
Module 1: Introduction to Snowflake and Data Cloud Concepts
- Overview of Snowflake architecture and key components
- Virtual Warehouses, Databases, Schemas, and Stages
- Cloud-agnostic design: AWS, Azure, GCP
- Snowflake services layer, compute layer, and storage layer
- Separation of compute and storage explained
Module 2: Snowflake Setup and Environment Configuration
- Creating and managing Snowflake accounts
- Understanding roles, users, and access controls
- Working with the Snowflake Web UI, Snowsight, and SnowSQL
- Configuring warehouses and resource monitors
- Integration with cloud storage (S3, Azure Blob, GCS)
Module 3: Data Loading and Unloading
- File staging (internal and external stages)
- COPY INTO and PUT commands
- Working with structured and semi-structured data (JSON, Parquet, Avro)
- Data unloading and exporting best practices
- Monitoring data load operations
Module 4: Data Modeling and Schema Design
- Star and Snowflake schema design patterns
- Fact and dimension tables
- Time-travel and zero-copy cloning for version control
- Data retention, cloning, and fail-safe mechanisms
- Handling schema evolution
Module 5: Data Transformation and Pipeline Design
- Building transformation pipelines with SQL and Snowflake Streams
- Tasks and scheduling automation
- Using Snowpark for Python and Java
- Data transformation using dbt (data build tool)
- Real-time data processing with Snowpipe
Module 6: Performance Tuning and Query Optimization
- Query profiling using Query History and Query Profile
- Warehouse scaling and caching strategies
- Clustering keys and micro-partitioning
- Query result reuse and materialized views
- Identifying and resolving performance bottlenecks
Module 7: Security and Governance
- Role-Based Access Control (RBAC) in Snowflake
- Network policies and encryption at rest/in transit
- Data masking and row access policies
- Monitoring user activities and resource usage
- Integration with identity providers (Azure AD, Okta, etc.)
Module 8: Cost Management and Optimization
- Understanding Snowflake credit usage and billing
- Query cost analysis and warehouse right-sizing
- Auto-suspend and auto-resume features
- Using Resource Monitors for cost governance
- Best practices for cost-efficient pipelines
Module 9: Integrations and Ecosystem
- Connecting Snowflake with Power BI, Tableau, and Looker
- Integration with Apache Airflow for orchestration
- ETL/ELT integration with Azure Data Factory, Matillion, and Informatica
- Working with Kafka and Snowpipe for streaming ingestion
- Exporting data to data lakes or external systems
Module 10: Hands-on Labs
- End-to-end pipeline development in Snowflake
- Automating data ingestion from cloud storage
- Query performance tuning exercises
- Implementing row-level security and masking policies
- Cost optimization simulation in a production-like scenario
Module 11: Best Practices and Real-World Scenarios
- Enterprise data architecture using Snowflake
- Multi-cluster warehouses for concurrency scaling
- Data sharing between organizations (Data Exchange)
- Migration strategies from on-premise databases to Snowflake
- Snowflake deployment lifecycle management
Module 12: Capstone Project
- Design, build, and optimize a complete Snowflake data engineering solution
- Integration with BI dashboard and external storage
- Performance benchmarking and documentation
- Instructor-led review and feedback session
Module 13: Advanced Topics and Future Trends
- Working with Snowflake Native Apps and Data Marketplace
- Streamlit and Snowpark for Data Applications
- Using AI/ML integrations within Snowflake
- Preview features: Dynamic tables, Unistore, and Cortex AI
- Future of cloud-based data engineering