Curso Data Science

  • Data Science Analytic

Curso Data Science

40 horas Curso Pratico
Visão Geral

O Curso Data Science oferece um pacote completo para se tornar um cientista de dados experiente. O treinamento se concentra em fornecer a você um conhecimento aprofundado de ciência de dados desde seu uso e aplicações, computação estatística R, manipulação de dados, visualização de dados, aplicação de estatísticas descritivas e inferenciais nos dados e muito mais. Além disso, você também adquirirá conhecimento prático executando projetos em tempo real e fornecendo soluções para os problemas. Resumindo, este treinamento online em Ciência de Dados deixa você totalmente confiante para enfrentar entrevistas e trabalhar como Cientista de Dados.

Objetivo

Após concluir este Curso Data Science, os alunos estarão em condições de dominar as seguintes áreas:

  • Funções e responsabilidades de um Cientista de Dados
  • Algoritmos de aprendizado de máquina
  • Integrando R com o ecossistema Hadoop
  • Regressão linear e logística
  • Clustering, segmentação de análise e previsão
  • Implantando sistemas de recomendação
  • Interpretação de dados, técnicas de plotagem e métodos de amostragem
Publico Alvo
  • Profissionais de Big Data, Analistas e Business Intelligence
  • Profissionais de aprendizado de máquina
  • Análise Preditiva e Arquitetos de Informação
  • Estatísticos de Big Data
  • Indivíduos que buscam uma carreira em Ciência de Dados
Materiais
Inglês/Português/Lab Prático
Conteúdo Programatico

Data Science Basics

In this module, you will learn the basics of data science and R programming, Importance of Data Science.

Topics covered in this section are: 

  • What is Data Science
  • Significance of Data Science in today’s world.
  • R Programming basics

Learning Outcomes: By the end of this module, you will get a fundamental idea about data science and R programming.

Python Fundamentals

This python fundamentals module discusses the python concepts required for a data scientist. 

Topics covered in this section are: 

  • Python Introduction
  • Indentations in Python
  • Python data types and operators
  • Python Functions

Learning Outcomes: By the end of this module, you will get the basic Python programming knowledge.

Data Structures and Data Manipulation

This module deals with the basic concepts of data structures and data visualization.

Topics covered in this section are: 

  • Data Structures Overview
  • Identifying the Data Structures
  • Allocating values to the Data Structures
  • Data Manipulation Significance
  • Dplyr Package and performing different data manipulation operations.

Learning Outcomes: Upon completing this module, you will be able to understand the significance of Data structures and Data manipulation in Data science.

Data visualization

This module discusses topics like Data visualization, types of graphs, Ggplot2 package, bar plots creation, Univariant, and Multivariant analysis.

Topics covered in this section are: 

  • Introduction to Data Visualisation
  • Various kinds of graphs, Graphics grammar
  • Ggplot2 package
  • Multivariant analysis by using geom_boxplot
  • Univariant analysis by using the histogram, barplot, multivariate distribution, and density plot.
  • Creating the bar plots for the categorical variables through geop_bar() and including the themes through the theme() layer.

Learning Outcomes: At the end of this module, you will be able to visualize the data through different graphs, Ggplot2 package. Also, you will get a real-time experience of bar plot creation, Univariant, and Multivariant analysis. 

Statistics

This Data Science online classroom training module deals with statistics concepts like Classification, Probability Types, Covariance, and Correlation. Along with this, you will learn how to analyze the given data set through Data Sampling, Hypothesis Test, and Binary Distribution.

Topics covered in this section are: 

  • Statistics Importance
  • Statistics classification, Statistical terminology.
  • Data types, Probability types, measures of speed, and central tendency.
  • Covariance and Correlation, Binary and Normal distribution
  •  Data Sampling, Confidence, and Significance levels.
  • Hypothesis Test and Parametric testing

Learning Outcomes: By the end of this module, you will gain practical knowledge of different statistical concepts like Probability types, Hypothesis test, Covariance. You will also be able to work with other statistics techniques like Correlation, Data sampling, Normal and Binary Distribution.

Introduction to Machine Learning

This Machine learning module discusses machine learning basics like Supervise learning, classification, linear regression, and ensemble learning techniques.

Topics covered in this section are: 

  • Machine Learning Fundamentals
  • Supervised Learning, Classification in Supervised Learning
  • Linear Regression and mathematical concepts related to linear regression
  • Classification Algorithms, Ensemble Learning techniques

Learning Outcomes: Upon completing this module, you will get a basic knowledge of machine learning, and you will be proficient in Supervised learning, Linear regression, and Ensemble learning. 

Logistic Regression

In this module, you will learn concepts like logistic regression basics, Bivariate and Multivariate Logistic regression, Poisson Regression. Also, it discusses developing logistic models and logistic regression applications.

Topics covered in this section are: 

  • Logistic Regression Introduction
  • Logistic vs Linear Regression, Poisson Regression
  • Bivariate Logistic Regression, math related to logistic regression
  • Multivariate Logistic Regression, Building Logistic Models
  • False and true positive rate, Real-time applications of Logistic Regression

Learning Outcomes: At the end of this module, you will get practical knowledge of Logistic regression, Linear Regression, Poisson Regression, and Logistic models.

Random Forest and Decision Trees

This module discusses topics like classification techniques, implementing random forest, Naive Bayes, Entropy, Information Gain, and Gini Index.

Topics covered in this section are: 

  • Classification Techniques. Decision Tree Induction Algorithm
  • Implementation of Random Forest in R
  • Differences between classification tree and regression tree
  • Naive Bayes, SVM
  • Entropy, Gini Index, Information Gain

Learning Outcomes: Upon completing this module, you will acquire an in-depth understanding of decision tree induction algorithms, implementing the random forest in the R programming.

Unsupervised learning

This module provides a detailed overview of different clustering types, K-means clustering algorithm, K-means clustering concepts, and implementing historical clustering and PCA in R programming.

Topics covered in this section are: 

  • Clustering, K-means clustering, Canopy Clustering, and Hierarchical Clustering
  • Unsupervised learning, Clustering algorithm, K-means clustering algorithm
  • K-means theoretical concepts, k-means process flow, and K-means implementation.
  • Implementing Historical Clustering in R
  • PCA(Principal Component Analysis) Implementation in R

Learning Outcomes: Upon completing this module, you will get a real-time experience of k-means clustering, clustering algorithm, and Principal Component Analysis.

Natural Language Processing

This data science online training module will help you master natural language processing,  text mining, and NPL working with text mining.

Topics covered in this section are: 

  • Natural language processing and Text mining basics
  • Significance and use-cases of text mining
  • NPL working with text mining, Language Toolkit(NLTK)
  • Text Mining: pre-processing, text-classification and cleaning

Learning Outcomes: At the end of this module, you will get a working knowledge of Natural Language Processing and Text Mining.

Mathematics for Data Science

This module discusses mathematical concepts like Probability basics, Bayes theorem, Numpy Mathematical functions, Conditional probability, and Joint probabilities.

Topics covered in this section are: 

  • Numpy Basics
  • Numpy Mathematical Functions
  • Probability Basics and Notation
  • Correlation and Regression
  • Joint Probabilities
  • Bayes Theorem
  • Conditional Probability, sum rule, and product rule

Learning Outcomes: By the end of this module, you will be able to use probability concepts, Numpy functions,  Bayes theorem, Correlation, and Regression in Data Science

Scientific Computing through Scipy

In this module, you will learn how to perform scientific computing through the Scipy library.

Topics covered in this section are: 

Scipy Introduction and characteristics
Scipy sub-packages like Integrate, Cluster, Signal, Fftpack, and Bayes Theorem

Learning Outcomes: By the end of this module, you will get a real-time scientific computing experience.

Python Integration with Spark

In this module, you will learn the basics, importance, installation, advantages, and applications of Pyspark.

Topics covered in this section are: 

  • Pyspark basics
  • Uses and Need of pyspark
  • Pyspark installation
  • Advantages of pyspark over MapReduce
  • Pyspark applications

Learning Outcomes: At the end of this module, you will acquire practical knowledge of Pyspark.

Deep Learning and Artificial Intelligence

In this module, you will learn the concepts like Deep learning basics, supervised learning, neural networks basics, deep neural networks, convolutional neural networks, recurrent neural networks, and Deep Learning Graphical Processing Unit(GPU).

Topics covered in this section are: 

  •  Machine Learning effect on Artificial Intelligence
  • Deep Learning Basics, Working of Deep Learning
  • Regression and Classification in the Supervised Learning
  • Association and Clustering in unsupervised learning
  • Basics of Artificial Intelligence and Neural Networks
  • Supervised Learning in Neural Networks, multi-layer network
  • Deep Neural Networks, Convolutional Neural Networks
  • Reinforcement Learning, dnn optimisation algorithms
  • Recurrent Neural Networks, Deep learning graphics processing unit
  • Deep Learning Applications, Time series modeling

Learning Outcomes: By the end of this module, you will be able to master the deep learning and artificial intelligence concepts required for a data scientist

Keras and TensorFlow API

This module teaches you how to use TensorFlow and Keras APIs to develop and deploy machine learning and deep learning models.

Topics covered in this section are: 

  • Tensorflow Basics and Tensorflow open-source libraries
  • Deep Learning Models and Tensor Processing Unit(TPU)
  • Graph Visualisation, keras
  • Keras neural-network 
  • Define and Composing multi-complex output models through Keras
  • Batch normalization, Functional and Sequential composition
  • Implementing Keras with tensorboard, customizing neural network training process
  • Implementing neural networks through TensorFlow API

Learning Outcomes: Upon completing this module, you will be able to build deep learning models and visualize the data through Keras and TensorFlow API.

Restricted Boltzmann Machine and Autoencoders

In this module, you will learn how to use restricted Boltzmann machines and autoencoders in deep learning.

Topics covered in this section are: 

  • Basics of Autoencoders and rbm
  • Implementing RBM for the deep neural networks
  • Autoencoders features and applications

Learning Outcomes: At the end of this module, you will achieve hands-on knowledge of Restricted Boltzmann machines and Autoencoders

Big Data Hadoop and Spark

This module allows you to master the concepts of Hadoop, MapReduce, Hive, Kafka, Scala, Spark, Kafka, Spark Streaming, and Dstreams.

Topics covered in this section are: 

  • Big Data and Hadoop Basics
  • Hadoop Architecture, HDFS
  • MapReduce Framework and Pig
  • Hive and HBase
  • Basics of Scala and Functional Programming
  • Kafka basics, Kafka Architecture, Kafka cluster and Integrating Kafka with Flume
  • Introduction to Spark
  • Spark RDD Operations, writing spark programs.
  • Spark Transformations, Spark streaming introduction
  • Spark streaming Architecture, Spark Streaming Features
  • Structured streaming Architecture, Dstreams, and Spark Graphx

Learning Outcomes: By the end of this module, you will acquire real-time experience of working with HDFS, MapReduce framework, HBase, and Kafka. You will also achieve extensive knowledge of developing Spark programs and performing Spark transformations and Spark RDD operations.

Tableau

This Tableau module deals with Data Visualisation concepts, Tableau Installation, Tableau Architecture, sets creation, Tableau Dashboards, Stories, Graphs, and Charts. Along with this, you will also learn expressions, data blending, and tableau prep.

Topics covered in this section are: 

  • Data Visualisation Basics
  • Data Visualisation Applications
  • Tableau Installation and Interface
  • Tableau Data Types, Data Preparation
  • Tableau Architecture
  • Getting Started with Tableau
  • Creating sets, Metadata and Data Blending.
  • Arranging visual and data analytics
  • Mapping, Expressions, and Calculations
  • Parameters and Tableau prep
  • Stories, Dashboards, and Filters
  • Graphs, charts
  • Integrating Tableau with Hadoop and R

Learning Outcomes: By the end of this module, you will get a real-time experience of Creating sets, graphs, charts, dashboards for analyzing data. You will also acquire hands-on knowledge of tableau architecture, tableau installation, tableau prep, and integrating Tableau with R and Hadoop.

MongoDB

This MongoDB module will help you master the concepts like MongoDB basics, MongoDB installation, CRUD operations, Data Indexing, Data Modeling, and Data Administration. Along with this, you will also learn Data Aggregation Schema and Security concepts.

Topics covered in this section are: 

  • MongoDB and NoSQL Basics
  • MongoDB Installation
  • Significance of NoSQL
  • CRUD Operations
  • Data Modeling and Management
  • Data Indexing and Administration
  • Data Aggregation Schema 
  • MongoDB Security
  • Collaborating with Unstructured Data

Learning Outcomes: At the end of this module, you will get hands-on knowledge of using MongoDB for performing different database operations like creating a database, inserting data into a database, deleting and updating the data. You will also be able to master data modeling, data Indexing, and data administration.

SAS

This module deals with the SAS analytic concepts like functions, operators, data sets creation, procedures, graphs, and macros. You will also learn some advanced concepts of SAS.

Topics covered in this section are: 

  • SAS Basics
  • SAS Enterprise Guide
  • SAS functions and Operators
  • SAS Data Sets compilation and creation
  • SAS Procedures
  • SAS Graphs
  • SAS Macros
  • PROC SQL
  • Advance SAS

Learning Outcomes: By the end of this module, you will be able to carry out advanced data analysis by using SAS concepts.

MS Excel

This data science online classroom training module deals with excel concepts like conditional formatting, data filtering, pivot tables, logical functions, and creating charts.  Along with this, you will also learn how to use VBA concepts for data analysis. 

Topics covered in this section are: 

  • Entering Data
  • Logical Functions
  • Conditional Formatting
  • Validation, Excel formulas
  • Data sorting, Data Filtering, Pivot Tables
  • Creating charts, Charting techniques
  • File and Data security in excel
  • VBA macros, VBA IF condition, and VBA loops
  • VBA IF condition, For loop
  • VBA Debugging and Messaging

Learning Outcomes: At the end of this module, you will acquire a working knowledge of excel and VBA.

TENHO INTERESSE

Cursos Relacionados

Curso Fundamentos de Gerenciamento de Dados Mestres

16 horas

Curso Big Data Analyst Mineração de Dados

32 horas

Curso Técnicas de integração de dados ETL

16 horas

Curso Big Data Boot Camp Visão de Negócios

Curso Inteligência Artificial / AI Visão Geral

8 horas

Curso Oracle Fundamentos de Big Data

32 horas

Curso Fundamentos de Qualidade de Dados

16 horas

Curso Marchine Learning Com Hadoop

32 horas

Curso Python for Data Analysis

24 horas