Skip main navigation

New offer! Get 30% off one whole year of Unlimited learning. Subscribe for just £249.99 £174.99. New subscribers only. T&Cs apply

Find out more

Square bins, missing plots, and cumulative plots

Missing BTK plots and cumulative plots

Square bins

When you first open any genome assembly in the BTK viewer, you should see a blob plot that looks like this BTK view with circles

However, sometimes you see a plot with square bins rather than circles: BTK view with bins

The reason is that in some genome assemblies there are hundreds of thousands of small contigs, and displaying them as individual interactive objects can slow down or even crash the web browser. By default, BTK viewer shows these assemblies as “binned” blob plots – with individual bins in the shape of squares. The binned blob plot is a very powerful tool for examining large numbers of contigs interactively. In fact, one of the main innovations of the BTK viewer is the ability to visualise thousands of separate taxonomic assignments interactively using these binned plots. You can use the Settings main menu to change the plot shape back to a circle, as you saw in the screencast video in step 2.6.

Missing Blob Plots and cumulative plots

In many cases, such as the figure below, there is no read data directly linked with the genome assembly in the public databases. In these cases, the BTK pipeline is unable to automatically calculate the sequencing depth and show that on the Y axis. If you are a researcher with a genome assembly like this, please contact blobtoolkit@genomehubs.org with the publicly available read accession numbers for a given genome assembly and the pipeline will be rerun for that genome.

In cases like this, where BTK has not yet calculated sequencing depth, the default view is the “Cumulative” plot, which shows the number of contigs in the assembly on the X axis (ordered from longest to shortest), and the cumulative length of the genome assembly for those contigs on the Y axis.

BTK view cumulative plot

In this plot for the milkfish Chanos chanos, the cumulative length plot rises sharply for the first 16 contigs, reaching a genome span of just over 650 Million bases, and the remaining 14 sequences (from number 17 to 30) in the assembly are very short, so the cumulative length plot is almost flat.

© Wellcome Connecting Science
This article is from the free online

Eukaryotic Genome Assembly: How to Use BlobToolKit for Quality Assessment

Created by
FutureLearn - Learning For Life

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now