Skip main navigation

New offer! Get 30% off your first 2 months of Unlimited Monthly. Start your subscription for just £35.99 £24.99. New subscribers only T&Cs apply

Find out more

Introduction to uncertainty and errors in data

Learn about uncertainty and errors in data
(dramatic music) In this video, we will explore uncertainty and the reasons behind is cause in data. Let us refresh a concept from one of our previous sections, error bars in Excel. In some data sets each data point has a particular error amount associated with it. Perhaps in another column in a spreadsheet. In Excel and other tools this value can be referenced to render the error bars. Error bars can be added to a chart in Excel in the same way as many other chart elements by selecting add chart element, then error bars. There are many reasons why we might have uncertainty in our data. As an analyst, you are naturally expected to account for it while conducting your analysis.
Let us try to understand this with this scenario here. Let us assume that a weighing machine is certified for a particular accuracy and gives the correct weight every time a person measures their weight. Now if we take samples from a population who use this weighing machine, the mean of the population will be the same as the mean of the mean weight of multiple samples.
Let us assume that the measurement has some kind of error to it and this machine weighs the person as 60 plus or minus one kilogrammes. In such a case, if we take samples from a population who use this weighing machine, we know the mean of these samples probably won’t match the mean or other measuring attributes of the population. This results in uncertainties in your data and can be measured as standard deviation, standard error of the mean, and confidence interval. Depending on the size of the sample compared to the size of the population, we may have more or less error or confidence. Throughout this section, you will learn to create error bars in Matplotlib using Python.
You will also learn to display discrete data using the confidence bands. Lastly, we will see what bootstrapping is and how effective the method is. Let’s get started.

Before we start learning how to display uncertainty using Python, watch this video to:

  • familiarise yourself with the constant changes in errors and uncertainty
  • understand the various methods of preparing your plots and choosing an appropriate plot type according to the data set.

Download your Jupyter Notebook

Before you start with this week, download your accompanying Jupyter Notebook containing explanations and codes in cells that you can run to receive outputs.

The Notebook contains only the code snippets that you can run to get an immersive and interactive experience, as well as instant results of the codes alongside the explanations.

Make best use of this opportunity to familiarise yourself with using the Notebook.

Download: Showing uncertainty

This article is from the free online

Data Visualisation with Python: Seaborn and Scatter Plots

Created by
FutureLearn - Learning For Life

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now