Skip main navigation

GroupBy mechanics

Consider this scenario: You are analysing data for a company that sells computer hardware. The company would like to look at the past year’s sales trends to see which types …

Reshaping data sets

Python has operations for rearranging tabular data, known as reshaping or pivoting operations. For example, hierarchical indexing provides a consistent way to rearrange data in a DataFrame. There are two …

Merge() parameters

Now, let’s explore some examples using various parameters in the merge function. Example 1: Inner joins Look at these dataframes: Click to enlarge Notice that both the left and right …

Merging Data Sets

Instead of simply concatenating two data sets, sometimes you might need to merge them. What’s the difference? A concatenation means combining dataframes as additional rows or columns, regardless of the …

CRISP-DM and Data Wrangling

Once the data has been loaded and extracted, it then needs to be cleaned, transformed, and rearranged. This process is known as data wrangling. Let’s watch a video to learn …

Data manipulation and transformations

We learned joining, merging, and rearranging data, but data analytics often requires many other manipulation operations. For example: bulk transforming records (eg, add missing address information) detecting and filtering outliers …