Skip main navigation

New offer! Get 30% off one whole year of Unlimited learning. Subscribe for just £249.99 £174.99. T&Cs apply

Overview of different data formats

Sequencing data can be saved in specific formats to make it easier to collect, store, analyse, and disseminate information. Each format will specify the manner in which each level of …

Amplicon-based sequencing

Introduction Amplicon sequencing is a highly targeted approach that enables researchers to analyse genetic variation in specific genomic regions. The ultra-deep sequencing of PCR products (amplicons) allows efficient variant identification …

Introduction to sequencing methods

The genomic era was propelled forward by DNA sequencing techniques pioneered by numerous scientists in the 1970s. Fredrick Sanger developed the “chain-termination method”, now known as the “Sanger method”, in …

SARS-CoV-2 genomic landscape

All of us have been impacted in some way by SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2). You are likely to be familiar with a city in China that you …

How is SARS-CoV-2 sequencing done?

There are many sequencing technologies used for SARS-CoV-2 (Figure 1). In this step, we will approach a broadly used protocol based on viral whole-genome sequencing (WGS). The method consists of …

What are clades, lineages and variants?

Pathogen genomics helps us to track the spread of an outbreak and to identify significant changes in the genome of a pathogen. This, in turn, helps us to identify the …

Variant calling and annotation

Introduction Identifying genomic variants can play an important role in scientific discovery as exemplified by the ongoing SARS-CoV-2 global surveillance. Variant calling involves identifying single nucleotide polymorphisms (SNPs), and small …

Acknowledgements

We would like to thank all those who were involved in the making of the course Making sense of genomic data: COVID-19 web-based bioinformatics Course Educators Carolina Castañeda Garcia PhD …

FastQC and MultiQC tools

FastQC In order to analyse the quality of the raw data generated from next-generation high throughput sequencers, quality control (QC) reports need to be generated. FastQC is a popular tool …

Data cleaning and quality control

Machines may be less error-prone than humans, but even machines can make mistakes. Or at the very least, they are only as good as the person/team who programmed them. The …

Resources

A useful range of COVID-19 and SARS-CoV-2 additional resources. Datasets for this course Zenodo repository: Making sense of genomic data: COVID-19 web-based bioinformatics Resources Coronavirus biology: ViralZone Coronavirus taxonomy: International …

Best practices for presenting your data

Presentations are an important part of the research process and allow the researcher a chance to illustrate the heart of their research to a public audience. The use of graphs …