x
Menu

Analyzing Data with Python

,, --- , Prof. Joseph Santarcangelo 0.0 ( Reviews) 23913 Students Enrolled

FVL is learner-supported. When you buy through links on our site, we may earn an affiliate commission

Updated On 02 Feb, 19

Overview

In this course, you will learn how to analyze data in Python using multi-dimensional arrays in numpy, manipulate DataFrames in pandas, use SciPy library of mathematical routines, and perform machine learning using scikit-learn!

Course Description

LEARN TO ANALYZE DATA WITH PYTHON

Learn how to analyze data using Python. This course will take you from the basics of Python to exploring many different types of data. You will learn how to prepare data for analysis, perform simple statistical analyses, create meaningful data visualizations, predict future trends from data, and more!

What you will learn

You will learn how to:



  • Import data sets

  • Clean and prepare data for analysis

  • Manipulate pandas DataFrame

  • Summarize data

  • Build machine learning models using scikit-learn

  • Build data pipelines

  • Data Analysis with Python is delivered through lecture, hands-on labs, and assignment.



It includes following parts:

Data Analysis libraries: will learn to use Pandas DataFrames, Numpy multi-dimentional arrays, and SciPy libraries to work with a various datasets. We will introduce you to pandas, an open-source library, and we will use it to load, manipulate, analyze, and visualize cool datasets. Then we will introduce you to another open-source library, scikit-learn, and we will use some of its machine learning algorithms to build smart models and make cool predictions.

Pre-Requesities

Some Python Experience

Syllabus

COURSE SYLLABUS

Module 1 - Importing Datasets


  • Learning Objectives

  • Understanding the Domain

  • Understanding the Dataset

  • Python package for data science

  • Importing and Exporting Data in Python

  • Basic Insights from Datasets


Module 2 - Cleaning and Preparing the Data


  • Identify and Handle Missing Values

  • Data Formatting

  • Data Normalization Sets

  • Binning

  • Indicator variables


Module 3 - Summarizing the Data Frame


  • Descriptive Statistics

  • Basic of Grouping

  • ANOVA

  • Correlation

  • More on Correlation


Module 4 - Model Development


  • Simple and Multiple Linear Regression

  • Model Evaluation Using Visualization

  • Polynomial Regression and Pipelines

  • R-squared and MSE for In-Sample Evaluation

  • Prediction and Decision Making


Module 5 - Model Evaluation


  • Model  Evaluation

  • Over-fitting, Under-fitting and Model Selection

  • Ridge Regression

  • Grid Search

  • Model Refinement

Ratings

0.0


Ratings
55%
30%
10%
3%
2%
Comments
comment person image

Sam

Sed sollicitudin risus eget nisl accumsan, nec gravida metus fringilla accumsan magna a lorem auctor sagittis.

Reply
comment person image

Dembe

Etiam volutpat, orci quis vulputate sodales, metus diam scelerisque ligula, sit amet conggaugue orci ut leo. Sed mattis suscipit urna sed finibus.

Reply
Send
x