Skip main navigation

Introduction to linear regression

Linear regression is one of the basic supervised learning methods.
In this video we explain basic `least squares` idea.
In this video lecture we explain the main characteristics of linear regression and how to perform it on large data sets using RHadoop. Let us continue first with songs complexity example. Recall we have a data set of songs and for each song we know its complexity measured by UNIGRAM and BIGRAM entropy. Based on the scatter plot we can conclude there must be a relatively strong linear relation between these two variables. Therefore we look for two parameters, beta zero and beta one, that define a line which fits well (best) the dots in the plot.
The criteria to find best line is actually a sum-of-square of vertical residuals. These are vertical distances between the data points and the vertical projections of these points to the line along the y axis. Note that we can also consider non-linear regression. For example, the goal of quadratic regression is to find parabola that fits best the data points. Criteria might be again the sum of squares of vertical residuals. Likewise we can define the exponential regression.
This article is from the free online

Managing Big Data with R and Hadoop

Created by
FutureLearn - Learning For Life

Our purpose is to transform access to education.

We offer a diverse selection of courses from leading universities and cultural institutions from around the world. These are delivered one step at a time, and are accessible on mobile, tablet and desktop, so you can fit learning around your life.

We believe learning should be an enjoyable, social experience, so our courses offer the opportunity to discuss what you’re learning with others as you go, helping you make fresh discoveries and form new ideas.
You can unlock new opportunities with unlimited access to hundreds of online short courses for a year by subscribing to our Unlimited package. Build your knowledge with top universities and organisations.

Learn more about how FutureLearn is transforming access to education