Skip main navigation

Solutions of the Two Case Studies

I hope you have enjoyed the challenge of the two case studies. You can now download the solution to compare with your own. Remember, this is just one possible solution.

Introduction to Data Analytics libraries

Pandas Pandas is a Python package providing fast, flexible, and expressive data structures designed to work with relational or labelled data. It is a fundamental high-level building block for doing …

Introduction to the Python Jupyter Notebook

The Jupyter Notebook The Jupyter Notebook is an open-source web-based application that you can use to create and share documents that contain live code, equations, visualisations, and text. It is …

Conclusion

Conclusion and Summary Well done! You have completed the Python for Big Data Analytics course! I hope you have enjoyed this course. During this course you have looked at the …

General Python Syntax

Introduction to Python Python is an interpreted computer language, which has become increasingly popular among data scientists. It was designed to emphasise code readability, by using white­space indentation to define …

Design of suitable architecture

Overall Architecture In Data Analytics (and computer science in general), an architecture is the organisation of the different components which are used to create a system. They are usually organised …

Introduction of relevant libraries in Python

Text Pre-Processing Once a text is loaded, the first step to remove any character and provide a uniform input by removing capital letter, digits and any other potential ambiguity. This …

Case Studies Introduction

Case Studies Introduction The first case study includes tweets regarding customers’ experience with airlines. The second case study focuses on the infamous Unite the Right rally, which took place in …

Introduction to Data Analytics libraries

Data Analytics and Natural Language Processing Data analytics is particularly useful when unstructured data is analysed. There are two types of data, structured and unstructured. The former includes any format …

Identification of requirements

Main Requirements The main requirements in these case studies include the use of some specific Python libraries. More specifically NLTK Seaborn NLTK stands for Natural Language Toolkit and it provides …

Introduction

Welcome to Big Data Analytics course. This course is designed to support you in understanding not just the basics of Big Data analytics but how it is so crucial in …

Specific methods relevant to data analytics

The main data analytics methods include Machine Learning Text mining Sentiment analysis Systematic reviews General statistical analysis Visualisation Let me go through them, one by one. Machine learning, as the …

Summary of Week 1

Summary This week, the following topics have been discussed Introduction to Big Data Analytics General introduction to Python Python libraries for Data Analytics The Anaconda environment and the Jupyter Notebook …

Week 2 Summary

Summary This week we have introduced the main steps to build a Data Analytics solution. More specifically Main Data Analytics libraries Identification of the solution requirements Design and assessment of …