Data Ark is hosting the NCBI BLAST database at the users’ request. This is a frequently updated database and the Data Ark team will manage and update the database regularly.
The nr database is downloaded by using the update_blastdb.pl script from the BLAST software, which is a preformatted version. For more information, click here.
The UniProt Reference Clusters (Uniref) have been formatted in a similar way, Data Ark hosts UniRef50, UniRef90, and UniRef100. For more information, click here.
Also hosted on the Data Ark is the Uniclust database; for more information, click here.
To use this data, NO DUA form is required. Access the data at the following path on Minerva –/sc/arion/projects/data-ark/Public_Unrestricted/BLAST or load module $ module load dataark to see the path variables.
To suggest new data sets, join our Data Ark Slack channel at https://join.slack.com/t/data-ark/signup and sign up using your Mount Sinai credentials.
Public Data Sets (unrestricted)
Public Data Sets (restricted)
Mount Sinai Generated Data (unrestricted)
Mount Sinai Generated Data (restricted)
School-Acquired Data Sets (restricted)
Data Set Supplements (restricted)