Skip main navigation

CRISP-DM and Data Wrangling

CRISP-DM and data wrangling

Once the data has been loaded and extracted, it then needs to be cleaned, transformed, and rearranged. This process is known as data wrangling. Let’s watch a video to learn more about data wrangling.

CRISP-DM

Do you remember learning about the CRISP-DM process earlier in Course 1?

Graphic shows the "Data analytics CRISP process". We see the six phases. "Business Understanding, "Data Understanding". Data Preparation", "Modeling", "Evaluation", and "Deployment". The diagram shows that the sequence is not strict and can move back and forth with arrows moving in both directions between the phases – this represents the cyclic nature of data mining.

In the previous course, we unpacked data ingestion as a part of the data understanding step of the CRISP-DM process. This week and next, we will have similar practical and hands-on tasks for the next step in the process of data preparation that includes data wrangling and transformation.

Sometimes, the way data is stored in the data sources (files, databases) is not in the format you need for a data processing application, and therefore substantial time is spent on data preparation.

Pandas, along with the various libraries and modules of Python, provide a flexible, high-level, and high-performing set of core manipulations and algorithms that enable you to perform the data wrangling into the required form.

This week, we will spend a lot of time on building the foundations of data wrangling activities that can be performed in Python by way of examples and programming.

This article is from the free online

Data Wrangling and Ingestion using Python

Created by
FutureLearn - Learning For Life

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now