Visão Geral
O curso Exploratory Data Analysis in AWS ensina como realizar análises exploratórias de dados (EDA) utilizando os principais serviços e ferramentas da AWS. O participante aprenderá a coletar, limpar, transformar e visualizar dados para gerar insights iniciais que auxiliam na modelagem preditiva e na tomada de decisões baseadas em dados.
Conteúdo Programatico
Module 1: Introduction to Exploratory Data Analysis (EDA)
- Understanding the importance of EDA
- Common techniques and objectives of EDA
- AWS services for data exploration and visualization
Module 2: Data Collection and Storage
- Storing and organizing data in Amazon S3
- Managing data formats (CSV, Parquet, JSON)
- Using AWS Glue Data Catalog for metadata management
Module 3: Data Preparation and Cleaning
- Data extraction and transformation with AWS Glue
- Handling missing values and duplicates
- Formatting and structuring data for analysis
Module 4: Querying and Exploration with Amazon Athena
- Running SQL queries on data stored in S3
- Creating tables and partitions
- Aggregations, filtering, and statistical summaries
Module 5: Exploratory Analysis with Amazon SageMaker Studio
- Introduction to SageMaker Notebooks
- Data exploration with Pandas and Matplotlib
- Statistical and correlation analysis
Module 6: Visualization and Insight Generation
- Visualizing data with Amazon QuickSight
- Building interactive dashboards
- Interpreting patterns and anomalies
Module 7: Automating EDA Workflows
- Integrating EDA with AWS Lambda and Step Functions
- Using event-driven workflows for data updates
- Automating summary reports and dashboards
Module 8: Best Practices and Cost Optimization
- Efficient data query and storage practices
- Security, access control, and IAM roles
- Managing costs during EDA processes