Skip main navigation

Introduction to week 2

A short introductory article to week 2 of experimental design for machine learning - Understanding and working with data
Welcome to week 2 of the course.

Last week we focused on data acquisition and annotation, the experimental phase of the project. This week we will look in more detail at some important things to think about once your datasets have been obtained, in order to prepare them for use in machine learning models, specifically organising your data, supplementing your data, and sharing your data.

The week has been divided into the following activities:

  • organising your datasets:
    • useful filenames
    • labelling within images e.g. QR codes
    • splitting datasets into subsets
  • expanding your dataset
    • data augmentation
    • synthetic data
    • using pre-existing datasets
  • releasing data – why might everyone benefit from you publicly sharing your datasets?

We will begin in the next step with a video which focuses on understanding and working with data.

Week 2 learning outcome:

  • Improve and organise datasets collected experimentally for use in machine learning and deep learning models

This article is from the free online

Experimental Design for Machine Learning

Created by
FutureLearn - Learning For Life

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now