User Tools

Site Tools


tassel-software

This is an old revision of the document!


Tassel

  • Version 3.0.165
  • Added August, 2014
  • One of the scientist requested us to install a new software known as tassel for evaluating traits associations, evolutionary patterns, and linkage disequilibrium.

Usage

  • Load tassel module
  • module load tassel/3.0.165

Memory Usage/Settings

  • Tassel uses java with a default memory setting of: -Xms512m -Xmx1536m
  • So you have to increase the maximum memory(-Xmx) to atleast 4GB otherwise tassel would run out of memory really fast
  • Use the following options to set the maximum memory size to 4GB:
    -Xms512m -Xmx4096m
  • For example:
    run_pipeline.pl -Xms512m -Xmx4096m

Installation

  • Download/clone tassel repo, in /export/apps/, create tassel/ folder & in it create tassel_version_number folder in this case it would be 3.0.165.
  • Copy over the binaries to /export/apps/tassel/tassel_version_number/ i.e. /export/apps/tassel/3.0.165/
  • Clone hpc-environment-modules from github
  • cd to hpc-environment-modules git repo & mkdir tassel, in it create the module file & name it the version number i.e. 3.0.165
  • Commit & push to github
  • Run ansible & limit to compute nodes & modules tag
  • ssh hpc
    git clone git://git.code.sf.net/p/tassel/tassel3-standalone tassel3
    cd tassel3/
    git checkout V3.0.165
    sudo su -
    mkdir -p /export/apps/tassel/3.0.165/
    cp -rv * /export/apps/tassel/3.0.165/
  • Clone environment modules & add a new module
  • git clone https://github.com/ilri/hpc-environment-modules.git
    cd hpc-environment-modules/
    mkdir /
    vim tassel/3.0.165
  • tassel module file
  • #%Module1.0#####################################################################
    ##
    ## tassel modulefile
    ## By James Oguya <j.oguya@cgiar.org>
    ## August, 2014
    ##
    
    module-whatis "Tassel is a software package to evaluate traits associations, evolutionary patterns, and linkage disequilibrium. http://www.maizegenetics.net/tassel/"
    
    # for TCL script use
    set     version         3.0.165
    set     appname         tassel
    set     prefix          /export/apps/${appname}/${version}
    set     exec_prefix     ${prefix}
    
    conflict tassel
    
    prepend-path    PATH            ${exec_prefix}
    
    conflict tassel
    
    # tassel depends on perl 5.16.3, load the perl module as well
    if { ![is-loaded perl] } {
            module load perl
    }
  • Deploy new module using ansible
  • ansible-playbook site.yml -i private/hosts --limit compute --tags=modules -u joguya -K
  • Test to confirm success: do a module avail, module load tassel/3.0.165
  • module avail
    module load tassel/3.0.165
    module ls
    run_anything.pl
    module unload
  • Sample script written by Maria Onyango
  • #!/bin/env bash
    
    #SBATCH -p batch
    #SBATCH -J tassel
    #SBATCH -n 10
    
    current_date=$(date +%b_%d)
    
    DATADIR="/home/monyango/data/cullicoides_br/$current_date"
    
    # create current_day folder
    mkdir $DATADIR
    
    #create UNEAK folder in $DATADIR
    mkdir ${DATADIR}/UNEAK
    
    #load tassel module
    module load tassel/3.0.165
    
    #sequence keys
    illumina_sequence_data='/home/monyango/data/cullicoides_br/Co_s_5_sequence.txt.gz'
    gbs_barcode_keys='/home/monyango/data/cullicoides_br/GBS_barcode_key.txt'
    
    # step 1: create workingdirplugin - create several folders in DATADIR/UNEAK
    run_pipeline.pl -Xmx4096m -fork1 -UCreatWorkingDirPlugin -w ${DATADIR}/UNEAK -endPlugin -runfork1
    
    # copy sequence data to DATADIR/UNEAK/Illumina & key to DATADIR/UNEAK/key
    cp ${illumina_sequence_data} ${DATADIR}/UNEAK/Illumina/
    cp ${gbs_barcode_keys} ${DATADIR}/UNEAK/key/
    
    # step 2: derive tagcount list
    run_pipeline.pl -Xmx4096m -fork1 -UFastqToTagCountPlugin -w ${DATADIR}/UNEAK -e PstI -endplugin -runfork1
    
    # step 3: merge tag count files of the same taxon
    run_pipeline.pl -Xmx4096m -fork1 -UMergeTaxaTagCountPlugin -w ${DATADIR}/UNEAK -c 5 -endplugin -runfork1
    
    # step 4: identify tag pairs for snip calling
    run_pipeline.pl -Xmx4096m -fork1 -UTagCountToTagPairPlugin -w ${DATADIR}/UNEAK -e 0.03 -endplugin -runfork1
    
    # step 5: generate tags by taxa file
    run_pipeline.pl -Xmx4096m -fork1 -UTagPairToTBTPlugin -w ${DATADIR}/UNEAK -endplugin -runfork1
    
    # step 6: generate map info file
    run_pipeline.pl -Xmx4096m -fork1 -UTBTToMapInfoPlugin -w ${DATADIR}/UNEAK -endplugin -runfork1
    
    # step 7: output hapmap file
    run_pipeline.pl -Xmx4096m -fork1 -UMapInfoToHapMapPlugin -w ${DATADIR}/UNEAK -mnMAF 0.05 -mxMAF 0.5  -mnC 0 -mxC 1 -endplugin -runfork1
tassel-software.1423569079.txt.gz · Last modified: by joguya