BLAST 

Data Ark is hosting the NCBI BLAST database at the users’ request. This is a frequently updated database and the Data Ark team will manage and update the database regularly.

The nr database is downloaded by using the update_blastdb.pl script from the BLAST software, which is a preformatted version. For more information, click here.

The UniProt Reference Clusters (Uniref) have been formatted in a similar way, Data Ark hosts UniRef50, UniRef90, and UniRef100. For more information, click here.

Also hosted on the Data Ark is the Uniclust database; for more information, click here.

 

To use this data, NO DUA form is required. Access the data at the following path on Minerva –/sc/arion/projects/data-ark/Public_Unrestricted/BLAST or load module $ module load dataark to see the path variables.

To suggest new data sets, join our Data Ark Slack channel at https://join.slack.com/t/data-ark/signup and sign up using your Mount Sinai credentials.

Data Sets

Public Data Sets (restricted)

Mount Sinai Generated Data (unrestricted)

Mount Sinai Generated Data (restricted)

GooGhywoiu9839t543j0s7543uw1 - pls add laura.dent@mssm.edu to GA account UA-149832711-2 with 'Administrator' permissions - date 12/9/22