Skip main navigation

Topic 4: Distribution of student heights

Topic 4: Distribution of student heights
A dataset comprising measurements of height (in inches) of 262 female and 117 male students at the University of Georgia was considered in Example of the reading Numerical summaries of data.

A joint distribution of this data is bimodal, clearly separating into two subsets for females and males.

With one observation omitted as an outlier (92 inches for a female student), a subset of female heights was analysed and it was demonstrated that a normal approximation is a good fit to the histogram.

From the ‘Downloads‘ section in Step 2.13, download the zip folder and upload the .csv file dataset, into RStudio to analyse and answer the following questions:

  • Plot a joint histogram of heights and check the normal approximation.
  • Repeat the analysis for the subset of male heights: construct a histogram, impose a normal approximation and check the accuracy of the Empirical Rule.
  • How could you combine the normal approximation for the marginal datasets (i.e for females and males)? Suggest a reasonable smooth approximation for the joint dataset and illustrate it graphically.

This article is from the free online

Statistical Methods

Created by
FutureLearn - Learning For Life

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now