guppy-software
Table of Contents
guppy
Guppy is a data processing toolkit that contains the Oxford Nanopore Technologies' basecalling algorithms, and several bioinformatic post-processing features.
See this tutorial: https://denbi-nanopore-training-course.readthedocs.io/en/latest/basecalling/basecalling.html
Information
- Version: 6.5.7
- Added: July, 2021
- Updated: December, 2023
Usage
Guppy is only available on compute06 because this is the only node that has a GPU.
Note: guppy ships with some pre-configured models that set many basecalling parameters to sensible defaults. Please consult: /opt/ont/guppy/data.
Installation
Notes from the sysadmin during installation.
$ sudo dnf install https://hpc.ilri.cgiar.org/~jbaka/ONT-Dec2023/Software/ont-guppy-6.5.7-1.el8.x86_64.rpm
Run some test data using fourteen CPUs:
$ mkdir -p /var/scratch/aorth/2022-05-24-guppy $ cd /var/scratch/aorth/2022-05-24-guppy $ wget https://openstack.cebitec.uni-bielefeld.de:8080/swift/v1/nanopore_course_data/Data_Group1.tar.gz $ tar xf Data_Group1.tar.gz $ guppy_basecaller --compress_fastq -i data/fast5_tiny/ -s data/basecall_tiny/ --cpu_threads_per_caller 14 --num_callers 1 -c dna_r9.4.1_450bps_hac.cfg ... Caller time: 3281435 ms, Samples called: 27619692, samples/s: 8416.96
Run some test data using a GPU (currently only on compute06):
$ srun --gres=gpu:v100:1 --partition=batch -w compute06 -n 1 -J guppy --pty bash $ guppy_basecaller --device "cuda:0" --compress_fastq -i data/fast5_tiny/ -s data/basecall_tiny/ -c dna_r9.4.1_450bps_hac.cfg ... Caller time: 4366 ms, Samples called: 27619692, samples/s: 6.32609e+06
guppy-software.txt · Last modified: 2023/12/06 06:42 by aorth