User Tools

Site Tools


biological-databases

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
biological-databases [2019/03/12 12:53]
jean-baka
biological-databases [2019/07/01 12:20] (current)
aorth
Line 1: Line 1:
-====== Biological ​sequence databases ​on the HPC ======+====== Biological ​Sequence Databases ​on the HPC ======
  
-Some of the most common biological sequence databases are available on the HPCfor you to use with tools like BLAST, etc. Below you can find the list of them, their source URL and the last time they were updated, with links to the update scripts that our system administrator use to perform updates.+~~NOTOC~~ 
 + 
 +Some of the most common biological sequence databases are available on the HPC for you to use with tools like BLAST. Below you can find the list of them, their location on the system, ​and the last time they were updated.
  
 We endeavor to keep this list updated as the One True List™. We endeavor to keep this list updated as the One True List™.
  
 +^Name        ^Comments ​  ​^Updated¹ ​             ^Database Location ​         ^
 +| nr/nt | NCBI nucleotide collection | 2019-06-20 | ''/​export/​data/​bio/​ncbi/​blast/​db''​ |
 +| nr/nt | NCBI protein collection | 2019-06-20 | ''/​export/​data/​bio/​ncbi/​blast/​db''​ |
 +| UniProt'​s UniProtKB/​Swiss-Prot | Manually curated, most reliable | 2019-07-01 | ''/​export/​data/​bio/​uniprot/​blast/​db''​ |
 +| UniProt'​s UniProtKB/​TrEMBL | Automated curation | ? | ''/​export/​data/​bio/​uniprot/​blast/​db''​ |
 +| UniProt'​s UniRef100 | | ? | ''/​export/​data/​bio/​uniprot/​blast/​db''​ |
 +
 +==== Using These Databases ====
 +To use these databases you generally need to set an environment variable pointing to the location of the database before running your program. For example, to use ''​nt''​ with NCBI ''​blastn'':​
  
-^Name        ^Version number ​  ^Last updated ​               ^Where it resides ​             ^ +<​code>​ 
-|NCBI nr/nt collection|2.1.4|end Nov 2018|''​/​export/​data/​bio/​ncbi/​blast/​db''​|+$ export BLASTDB=/​export/​data/​bio/​ncbi/​blast/​db 
 +$ blastn -db nt -query file.seq -out blast.out 
 +</​code>​
  
 +==== Notes ====
 +¹ Use the following to determine the date of a BLAST database: ''/​export/​apps/​blast/​2.7.1+/​bin/​blastdbcmd -info -db nt | grep Date''​
biological-databases.1552384401.txt.gz · Last modified: 2019/03/12 12:53 by jean-baka