Learn more about this course.

A primer on genome sequencing

In this article we discuss the key concepts in genome sequencing

The genome of an organism consists of one or more stretches of DNA, which can be thought of as strings of the letters A, T, G and C. A particular string of DNA letters is called a sequence. DNA letters are also known as bases. Because DNA is a double helix, consisting of pairs of bases, the length of a DNA sequence is described as a number of base pairs, or bp for short. The process of determining these sequences is called genome sequencing. The perfect genome sequencing technology would enable us to identify the order of letters in a chromosome all in one go. However, this is not currently possible.

DNA Double Helix and Sequence

Diagram of DNA Double Helix and Sequence Courtesy: National Human Genome Research Institute

Instead we are only able to determine shorter sequences, called reads. The length and number of these reads varies with different technologies. The first technology used to sequence bacterial genomes was Sanger sequencing. Watch this video to see how the technology works. It produces 500bp reads, but is expensive and laborious. Illumina’s sequencing-by-synthesis approach can produce reads of 75-250bp much more cheaply. Watch this video explaining the technology. This technology also produces a large number of reads, perhaps 500 million at a time. The Pacific Biosciences Single-Molecule Real Time (SMRT) technology produces reads of 5000bp-40000bp. The number of reads is smaller – around 1 million. Different technologies are useful for different applications. In particular SMRT sequencing is most useful for producing new genome reference sequences from scratch, whereas sequencing-by-synthesis is most useful for looking at variation between genomes (resequencing).

Want to keep
learning?

This content is taken from
Wellcome Connecting Science online course,

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

View Course

To get a good understanding of what a whole genome looks like we need to start with many copies of the bacterial genome (i.e. lots of bacteria). If we only had one genome, there would be many parts of the genome that would not be captured, because some pieces are lost by chance during preparation for sequencing. There are also usually a small number of errors in each read. Therefore, to be sure of producing an accurate final sequence, we need to sample each part of the genome several times.

Want to keep learning?

This content is taken from Wellcome Connecting Science online course

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

View Course

See other articles from this course

This article is from the free online

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

Created by

Join Now

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now

Learn more about this course.

A primer on genome sequencing

Want to keep
learning?

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

Want to keep learning?

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

Reach your personal and professional goals

Register to receive updates

Learn more about this course.

Learn more about this course.

See all FutureLearn courses.

Learn more about this course.

A primer on genome sequencing

Want to keep learning?

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

Want to keep learning?

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

Share this

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

Bacterial Genomes: Disease Outbreaks and Antimicrobial Resistance

Reach your personal and professional goals

Register to receive updates

Learn more about this course.

Learn more about this course.

See all FutureLearn courses.

Want to keep
learning?