Skip main navigation

New offer! Get 30% off one whole year of Unlimited learning. Subscribe for just £249.99 £174.99. New subscribers only. T&Cs apply

Find out more

Using AI variant interpretation tools

Article with an introduction to the main artificial intelligence tools applied to variants interpretation

In recent years, state-of-the-art artificial intelligence (AI) has been applied to improve the variant interpretation process. Implementing AI algorithms is especially helpful when a large amount of data needs to be processed to arrive at an answer – therefore making it very applicable to variant analysis. Using AI-enabled tools for variant interpretation might also be helpful when working in a resource-limited setting, or on interpreting variants from diverse populations where the data resources might be sparse and sporadic.

AI can be subdivided into different fields, including natural language processing, machine learning and deep learning. Different tools may use the same or different AI processes. It is important to consider that there isn’t an optimal approach for everything – one should consider the mechanisms of the tools and the purpose of the research to choose an approach accordingly.

DeepVariant (developed by Google) and DNAscope are two examples of AI-based variant callers that may be used instead of more conventional tools, such as BCFTools. A recent evaluation of these tools showed that the AI-based tools outperformed conventional tools for calling SNVs and INDELs. These results held true for both short-read and long-read generation sequencing data. AI-driven variant prioritisation pipelines have also been shown to be a highly efficient approach that can accelerate the discovery of clinically relevant genetic variants. Franklin by Genoox is an example of an extensive AI-based evidence network of genomic and clinical data, which is available freely to the scientific community.

AI-based tools for detection and prioritisation of disease-causative variants

Variant calling

Single Nucleotide Variants (SNPs)/indels

Tools Methods Launch year
GATK HMM, Bayesian, etc. 2010
FreeBayes Bayesian 2012
DeepVariants Deep CNN 2018

HMM: Hidden Markov Model; CNN: Convolutional Neural Networks

Copy Number Variants (CNVs)

Tools Methods Launch year
PennCNV HMM 2007
CN-Learn Random Forest 2019
CNV-JACG Random Forest 2020
DeepCNV Deep CNN 2021

Variant prioritising

Coding variants

Tools Methods Launch year
CADD SVM 2014
REVEL Random Forest 2016

SVM: Support Vector Machine

Splicing variants

Tools Methods Launch year
SpliceAI Deep CNN 2019
MMSplice Deep CNN 2019

Regulatory variants

Tools Methods Launch year
DeepSEA Deep CNN 2015
HeartENN Deep CNN 2020
MARVEL GLM-LARS 2020

GLM-LARS: Generalised Linear Model-based Least Angle Regression

Currently, there are AI tools to perform each step of the variant interpretation workflow (as illustrated in the Table above). Choosing which tool to use may depend on your setting and resources. These tools may be especially useful when you have limited bioinformatics support, as they are user-friendly and intuitive. Some tools are proprietary software, which means that there is a cost involved in using these tools, another important consideration, especially in resource-constrained environments. It is important to consider that AI tools are only as good as the data available to train these tools – AI is therefore also limited by the lack of diversity in data available for variant interpretation.

Use the comments to share other AI tools you may know and your experience with this technology for genomics.

© Wellcome Connecting Science
This article is from the free online

Interpreting Genomic Variation: Overcoming Challenges in Diverse Populations

Created by
FutureLearn - Learning For Life

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now