Skip main navigation

New offer! Get 30% off one whole year of Unlimited learning. Subscribe for just £249.99 £174.99. New subscribers only. T&Cs apply

Find out more

Week 2 review

A summary of Week 2 of Experimental design for machine learning, covering organisation, expansion and release of datasets.
That’s the end of Week 2 of the course.

We hope that it has given you some useful ideas as to the best ways to organise and maximise the use of your datasets.

Specifically we looked at the following:

  • organising your datasets
    • sensible filenames – unique and descriptive names can help to organise your data
    • use of QR codes and barcodes as labels within images
    • scales within images – for relating pixels to actual size
    • division of datasets into training, valdation and test sets
  • expanding your datasets
    • augmentation – making slightly altered copies of existing images
    • synthetic data – creating entirely new, artificial images to train models
    • pre-existing datasets – other studies may have used similar images with which you can partially train your model
  • releasing data
    • types of license – e.g. Creative Commons
    • reasons for sharing data.

What’s next?

In the first two weeks of the course we have focused almost entirely on the data aspect of machine learning. In Week 1 we talked about acquiring and annotating data, while in Week 2 we looked at ways in which data can be organised and supplemented.

Next week, we will begin looking in more detail at what we can do with our data using machine learning and deep learning models.

This article is from the free online

Experimental Design for Machine Learning

Created by
FutureLearn - Learning For Life

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now