Skip main navigation
We use cookies to give you a better experience, if that’s ok you can close this message and carry on browsing. For more info read our cookies policy.
We use cookies to give you a better experience. Carry on browsing if you're happy with this, or read our cookies policy for more information.

DNA, the code for life

What is the genome?

Increasingly clinicians, scientists and even the media, are talking about the genome and the impact of genomic data, now and in the future, on healthcare.

The genome describes an organism’s complete set of genetic instructions. For the human genome this means the ~ 3.2 billion bases which contain the code for ~ 20 000 genes.

As we discussed previously, DNA (or deoxyribonucleic acid) is a long molecule that contains our unique genetic code. Like a recipe book it holds the instructions for making all the proteins in our bodies. As famously described by Watson and Crick in 1953, DNA usually exists as two coiled chains, like a twisted ladder. This is called the double helix. The rungs of the ladder consist of nucleotides. Nucleotides are composed of a nitrogenous base, a five-carbon sugar (ribose or deoxyribose), and at least one phosphate group. The nitrogenous bases are called adenine (A), guanine (G), thymine (T) and cytosine (C). A will always pair with T and C will always pair with G via hydrogen bonds. In this course, to avoid confusion, we use the term “base” rather than “nucleotide”, although the two terms are often used interchangeably.

Image of Double Helix © YourGenome

In order that cells can develop, grow and differentiate to fulfill particular roles- such as eye cells, muscle cells, blood cells etc- they must generate proteins. The mechanisms whereby proteins are produced will be discussed at greater length in the next section. However, it is important to appreciate (and mind-blowing to realise) that the instructions for building proteins and therefore the basis for complex life as we know it, rests with the simple four base DNA code: because each group of three consecutive bases encodes one of 20 amino acids, the basic building blocks of proteins, different combinations of bases result in differently ordered amino acids and different proteins.

The Genetic Code © NHS National Genetics and Genomics Education Centre

Finally, a key feature of DNA and fundamental to its ability to faithfully replicate itself, encode proteins and the basis to modern techniques to decipher the genome is the fact that the two chains of the double helix will only fit together correctly if the A is opposite T and C is opposite G. One strand of the DNA will therefore act as a template for the synthesis of an exact replica of the opposite strand. As Watson and Crick famously said “It has not escaped our notice that the specific pairing we have postulated immediately suggests a possible copying mechanism for the genetic material….”

Share this article:

This article is from the free online course:

The Genomics Era: the Future of Genetics in Medicine

St George's, University of London

Get a taste of this course

Find out what this course is like by previewing some of the course steps before you join:

  • Welcome to Week 1
    Welcome to Week 1
    video

    In this video, Lead Educator, Dr Kate Tatton-Brown welcomes learners to the course and explains the course aims and outcomes.

  • Did you know?
    Did you know?
    video

    Our resident scientist tells you his favourite genomics facts.

  • Errors in recombination
    Errors in recombination
    article

    This video describes how structural chromosome abnormalities occur when errors occur in recombination.

  • Responsibility in the genomic era
    Responsibility in the genomic era
    video

    In this tutorial, you will hear from Dr Carwyn Rhys Hooper on the concept of responsibility for health.

Contact FutureLearn for Support