Data Ark is hosting the NCBI BLAST database at the users’ request. This is a frequently updated database and the Data Ark team will manage and update the database regularly.

The nr database is downloaded by using the script from the BLAST software, which is a preformatted version. For more information, click here.

The UniProt Reference Clusters (Uniref) have been formatted in a similar way, Data Ark hosts UniRef50, UniRef90, and UniRef100. For more information, click here.

Also hosted on the Data Ark is the Uniclust database; for more information, click here.


To use this data, NO DUA form is required. Access the data at the following path on Minerva –/sc/arion/projects/data-ark/Public_Unrestricted/BLAST or load module $ module load dataark to see the path variables.

To suggest new data sets, join our Data Ark Slack channel at and sign up using your Mount Sinai credentials.

Data Sets

Public Data Sets (restricted)

Mount Sinai Generated Data (unrestricted)

Mount Sinai Generated Data (restricted)

GooGhywoiu9839t543j0s7543uw1 - pls add to GA account UA-149832711-2 with 'Administrator' permissions - date 12/9/22