This is an old revision of the document!
Table of Contents
shovill
The SPAdes genome assembler has become the de facto standard de novo genome assembler for Illumina whole genome sequencing data of bacteria and other small microbes. SPAdes was a major improvement over previous assemblers like Velvet, but some of its components can be slow and it traditionally did not handle overlapping paired-end reads well.
Shovill is a pipeline which uses SPAdes at its core, but alters the steps before and after the primary assembly step to get similar results in less time. Shovill also supports other assemblers like SKESA, Velvet and Megahit, so you can take advantage of the pre- and post-processing the Shovill provides with those too.
⚠️ Shovill is for isolate data only, primarily small haploid organisms. It will NOT work on metagenomes or larger genomes. Please use Megahit directly instead.
Information
- Version: 1.1.0
- Added: May, 2025
Usage
See which versions are available:
$ module avail shovill
Load one version into your environment and run it:
$ module load shovill/1.1.0 $ shovill
Installation
Notes from the sysadmin during installation:
$ cd /tmp $ git clone https://github.com/tseemann/shovill.git -b v1.1.0 $ sudo mkdir -p /export/apps/shovill/1.1.0 $ sudo cp -r shovill/* /export/apps/shovill/1.1.0
Test with:
$ module load shovill/1.1.0 Loading shovill/1.1.0 Loading requirement: spades/3.15 megahit/1.2.9 skesa/2.5.1 velvet/1.2.10 lighter/1.1.2 flash/1.2.11 htslib/1.11 samtools/1.11 bwa/0.7.18 kmc/3.1.1 seqtk/1.4 pilon/1.24 trimmomatic/0.39 samclip/0.4.0 $ shovill --version shovill 1.1.0