Skip main navigation

Get 30% off one whole year of Unlimited learning. Subscribe for just £249.99 £174.99. T&Cs apply

Connecting to databases: exercise part 1

exercise with data upload
learner's hand working at a computer keyboard with a screen in front

Check your learning: connecting to databases

In this activity we will download and search the NCBI database list to find the blaKPC gene. You can find out more about this gene in this review article.

Step 1: Navigate to the NCBI database FTP page and download the database overview file

This will bring up a list of different files, each with a different purpose. The one we care about is the ReferenceGeneCatalog.txt file.
  • Right click that file and save it to your computer. If having trouble, you can download it from the bottom of this step.
  • This file is a TSV (tab separated values) file. You should be able to load this directly into excel or similar to make search and filtering easier
If you are having issues with that, I have created an excel version of this file for you which you can download it from the bottom of this step.
Step 2: Understand the columns
  • The database overview is made up of many columns, each with different information about the AMR-related genes and alleles which are in the rows.
  • The database overview file has many columns but the ones we care about are:
Gene_family: the name of the gene that relates to that row entry
Product_name: What protein is created by that gene
Class: the general type of antimicrobial that this gene causes resistance to
Subclass: the specific type of antimicrobial that this gene causes resistance to
Pubmed_reference: the publication that is the underlying evidence for this genotype-phenotype link
Db_vesion: What is the version of the NCBI website you are looking at (important when writing your methods section)
  • Ensure you are familiar with the different columns
Step 3: Search for the blaKPC gene
  • Use either the search function in excel or the filter function to find in the column gene_family the gene blaKPC
All allele entries for this gene should be together in the file so ensure you are looking at the different rows, not just the first one for this gene.
  • In the next step you can answer the questions about this gene, so when you are ready, click Next button.
© Wellcome Connecting Science
This article is from the free online

Antimicrobial Databases and Genotype Prediction: Data Sharing and Analysis

Created by
FutureLearn - Learning For Life

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now