Data Science in Python Pandas, Scikit-learn,Numpy Matplotlib
Python, Numpy, Pandas, Matplotlib, WebScraping, Preprocessing, cleaning data, Machine Learning, Pyspark, statistics
What you'll learn
- Install anaconda and setup python environment
- Python crash course
- Numpy - Numerical python
- Pandas - Data analysis library
- MatplotLIb - Data Visualization library
- Plotly
- Data Pre-processing technique - Missing data, Normalization, one hot encoding,
- Importing data in Python from different sources, Files
- Web Scraping to download web page and extract data
- Data scaling and transformation
- Exploratory Data analysis
- Feature engineering
- Machine learning basic theory
- Apache spark installation : pyspark
- Getting started with spark session
- Spark Hello world
- Statistics basics
- Basics of Probability
- Setup Data Science Virtual machine on Microsoft Azure Cloud
Author: Ankit Mistry
No comments