Gunnika Batra Google Code-in Mentor @Tensorflow | Lead @WTMBVP | GitHub Campus Expert | Former Data Analytics Intern @SHEROES | Python and food enthusiast

Data Science Bootcamp – Day #15 – Distribution, Skewness and Data Cleaning

33 sec read

Hello learners!

Let’s broaden our statistics knowledge and learn about Distribution and Skewness today. There are a number of distributions in statistics but we’ll focus on Normal Distribution as most statistical models rely on it.

Data skewness is one of the important challenges that data scientists often face in real time case studies. We’ll figure out what positive and negative skewness means in statistics.

We have a well-documented notebook for performing Exploratory Data Analysis on Wine Dataset. You’ll figure out which of the two wines- Red or White have a better quality through the means of various beautiful graphs.

Apart from these, we have a great blog emphasizing all the steps of Data Cleaning using the Russian Housing Dataset. It covers everything we’ve learnt till now. You’ll analyse and visualise data, detect outliers, remove irrelevant and inconsistent values and get structured, clean data at the end.

Find the module below:


Happy learning!
Gunnika
Team DPhi

Call for Volunteers to Coach Learners for the Data…

Anyone who is passionate about Data Science & Machine Learning and is looking forward to making a difference by being a part of our...
DPhi
1 min read

One year of DPhi – it is still day…

As all ambitious journeys have humble beginnings, we had ours too. It was a year back, still remember those intense days scouting for speakers...
Chanukya Patnaik
1 min read

Top Dash Applications Submissions – Data Analysis & Visualizations…

We thoroughly enjoyed hosting Data Analysis and Visualization 101 Bootcamp where we saw enthusiastic participation from several learners across the globe. During the Bootcamp we...
DPhi
30 sec read

Leave a Reply

Your email address will not be published. Required fields are marked *