Skip main navigation

Finding protein function: Functional annotation

Learn about finding more information about the function of the proteins that genes encode, and annotation methods such as functional annotation.
Large RUVA protein molecule bound to thin double stranded DNA - side view
© Wellcome Genome Campus Advanced Courses and Scientific Conferences

In addition to finding where genes are in the genome, it is possible to find more information about the function of the proteins they encode. Almost universally, sequence similarity tools are used to transfer functions from one known protein to an unknown protein, provided they are similar enough. Using similarity between sequences to infer their function is called homology annotation.

Initially, CDSs were semi-automatically marked on the genome and subsequently manually checked to ensure they were ‘true’ genes encoding ‘true’ proteins. Later as technologies evolved, more high-throughput methods of predicting CDS were developed using approaches that predicted the open reading frames and manually curated using tools such as BLASTx and comparing the newly identified CDS with protein databases such as UniProt or the Non-redundant protein database at NCBI-NIH.

More modern annotation approaches include online based tools such as RAST. RAST: Rapid Annotations using Subsystems Technology as it alludes to in the title, uses a ‘subsystem’ or pathway approach to annotate bacterial or archaeal genomes. In short, it classifies the genes into functional roles such as metabolic pathways or a collection of functional roles such as transport systems, thus ensuring a more accurate annotation.

In this course, we will use ready made annotation files from related bacterial pathogens, and we will focus on comparing these in order to gain more insight into the biology of bacterial genomes.

In 2008, Aziz et al. published The RAST Server: Rapid Annotations using Subsystems Technology https://bmcgenomics.biomedcentral.com/articles/10.1186/1471-2164-9-75, a paper describing RAST. You can follow the link to read more about RAST and its capabilities.

© Wellcome Genome Campus Advanced Courses and Scientific Conferences
This article is from the free online

Bacterial Genomes II: Accessing and Analysing Microbial Genome Data Using Artemis

Created by
FutureLearn - Learning For Life

Our purpose is to transform access to education.

We offer a diverse selection of courses from leading universities and cultural institutions from around the world. These are delivered one step at a time, and are accessible on mobile, tablet and desktop, so you can fit learning around your life.

We believe learning should be an enjoyable, social experience, so our courses offer the opportunity to discuss what you’re learning with others as you go, helping you make fresh discoveries and form new ideas.
You can unlock new opportunities with unlimited access to hundreds of online short courses for a year by subscribing to our Unlimited package. Build your knowledge with top universities and organisations.

Learn more about how FutureLearn is transforming access to education