£199.99 £139.99 for one year of Unlimited learning. Offer ends on 14 November 2022 at 23:59 (UTC). T&Cs apply

Find out more
Practical activity: Qualitative statistics using Python libraries
Skip main navigation

Practical activity: Qualitative statistics using Python libraries

Perform qualitative statistics using Python libraries.

Dataset

We are going to use the same set of data that we used in the previous module. Here is a recap of the introduction to the scenario and the dataset:

Scenario

With the alarming rise in the number of forest fires recorded in Brazil alone, authorities and people in the country, and across the world, are worried. Hence, understanding the frequency of the forest fires and finding other patterns using data analysis can help with taking proactive actions to prevent them.

In the exercise, you are exclusively going to use the Forest Fire Data that has been made available by Brazilian Forest Departments. [1]

For this exercise, the data has been pre-formatted as comma-separated values and translated into English where applicable.

The data set includes the following information:

  • Year: when the fire occurred

  • State: where the fire was reported

  • Month: when the fire occurred

  • Number of fires: frequency reported

  • Date reported: when the fire was reported.

Download the CSV file of the dataset and get started with the activities in hand at the end of each module on the already provided Jupyter Notebooks.

Download: Brazilian-fire-dataset.csv

Download the file and save it under the file path as shown below:

dataset/brazilianfire.csv

Exercise

In this section, you will be writing the Python code to perform certain activities as instructed in the notebook.

Download: Quantitative-Statistics-using-NumPy-and-Pandas-Exercise

You will see that there are cells with the heading TO DO – Activity. You are required to write the code in the code cells after the activity markdown cells.

Solution

Once you finish adding your codes and running them on the notebook, download the solution notebook to compare and contrast with the expert copy of the notebook provided to you.

Download: Quantitative-Statistics-using-NumPy-and-Pandas-solution.ipynb

All the best for this exercise!

Share your experience with this exercise in the comments and let others know how you plan to use your learning from this to frame your data pipeline for the final assessment.

This article is from the free online

Python Packages: NumPy and Pandas Dataframe

Created by
FutureLearn - Learning For Life

Our purpose is to transform access to education.

We offer a diverse selection of courses from leading universities and cultural institutions from around the world. These are delivered one step at a time, and are accessible on mobile, tablet and desktop, so you can fit learning around your life.

We believe learning should be an enjoyable, social experience, so our courses offer the opportunity to discuss what you’re learning with others as you go, helping you make fresh discoveries and form new ideas.
You can unlock new opportunities with unlimited access to hundreds of online short courses for a year by subscribing to our Unlimited package. Build your knowledge with top universities and organisations.

Learn more about how FutureLearn is transforming access to education