The pipeline, based on the program GeneWise, is designed to analyze an annotated genome and automatically identify missed gene calls and sequence variants such as genes with disrupted reading frames (split genes) and those with insertions and deletions (indels). For a given genome to be analyzed, GenVar relies on a database containing closely related genomes (such as other species or strains) as well as a few additional reference genomes. GenVar also helps identify gene disruptions probably caused by sequencing errors. We exemplify GenVar's capabilities by presenting results from the analysis of four Brucella genomes. Brucella is an important human pathogen and zoonotic agent. The analysis revealed hundreds of missed gene calls, new split genes and indels, several of which are species specific and hence provide valuable clues to the understanding of the genome basis of Brucella pathogenicity and host specificity. http://nar.oxfordjournals.org/cgi/content/abstract/gkm377v1
Get the right version from http://patric.vbi.vt.edu/downloads/software/GenVar/
Extract the content into the desired directory
tar xf GenVar2.0.tar
Create a symbolic link to the /usr/bin directory
ln -s /mnt/export/paracel/biosoft/genvar/GenVar2.2.4_beta/genvar.pl /usr/bin/
for all the nodes:
rocks run host "ln -s /mnt/export/paracel/biosoft/genvar/GenVar2.2.4_beta/genvar.pl /usr/bin/"