The CBIPM-BioMe Data


The CBIPM (The Charles Bronfman Institute for Personalized Medicine) data set hosted under Data Ark currently includes the BioMe GSA (Regeneron) and GDA (Sema4) micro-array, Whole Exome Sequencing Data (Regeneron), and BioMe Epic EHR Data Mart. All the data are de-identified versions. 


For details on the study design and quality control, please review the following documents carefully:

The GSA (Regeneron) and GDA (Sema4) microarray Document

Regeneron Whole Exome Sequencing Data Document

BioMe Epic EHR data mart – Link coming soon

Here is the CBIPM data folder tree structure on Minerva:


├── datamart

│   └── BioMe_De_Identified

├── Microarray

│   └── combined

│       ├── genotyped_TOPMED_V2

│       ├── imputed_1kg_V2

│       └── imputed_TOPMED_V2

└── WES

    └── regeneron



This data set is only open to Mount Sinai researchers, staff and faculty. To use these data, you must read, agree and sign the Data Use Agreement (you must be logged in through the Mount Sinai campus network or secure remote VPN).


More information

Please visit the CBIPM website by Click Here. If you are still interested in more information on the CBIPM data set, please contact

Data Sets

Public Data Sets (restricted)

Mount Sinai Generated Data (unrestricted)

Mount Sinai Generated Data (restricted)

School-Acquired Data Sets (restricted)

Data Set Supplements (restricted)

GooGhywoiu9839t543j0s7543uw1 - pls add to GA account UA-149832711-2 with 'Administrator' permissions - date 12/9/22