biological-databases
This is an old revision of the document!
Biological Sequence Databases on the HPC
Some of the most common biological sequence databases are available on the HPC for you to use with tools like BLAST. Below you can find the list of them, their location on the system, and the last time they were updated.
We endeavor to keep this list updated as the One True List™.
Name | Version Number | Last Updated¹ | Where it Resides | How to Use |
---|---|---|---|---|
NCBI nr/nt nucleotide collection | N/A | 2018-11-24 | /export/data/bio/ncbi/blast/db | Use BLASTDB=/export/data/bio/ncbi/blast/db and blastn … -db nt in your Bash script |
NCBI nr/nt protein collection | N/A | 2018-08-16 | /export/data/bio/ncbi/blast/db | use BLASTDB=/export/data/bio/ncbi/blast/db and blastp … -db nr in your Bash script |
UniProt's UniProtKB/Swiss-Prot (manually curated, most reliable) | N/A | ? | /export/data/bio/uniprot/blast/db | use BLASTDB=/export/data/bio/uniprot/blast/db in your Bash script |
UniProt's UniProtKB/TrEMBL (automated curation) | N/A | ? | /export/data/bio/uniprot/blast/db | use BLASTDB=/export/data/bio/uniprot/blast/db in your Bash script |
UniProt's UniRef100 | N/A | ? | /export/data/bio/uniprot/blast/db | use BLASTDB=/export/data/bio/uniprot/blast/db in your Bash script |
Footnotes
¹ Use the following to determine the date of a BLAST database: /export/apps/blast/2.7.1+/bin/blastdbcmd -info -db nt | grep Date
biological-databases.1552506545.txt.gz · Last modified: 2019/03/13 19:49 by aorth