About The Mount Sinai Data Warehouse

The Mount Sinai Data Warehouse (MSDW) collects clinical and operational data for use in clinical and translational research, as well as quality and improvement initiatives.


Mount Sinai Data Warehouse Ecosystem

Click here to read more about data sources, transactions, and comprehensive descriptions of data in the Data Warehouse. MSDW is approved by the IRB.

For more resources, presentations, training materials, and documentation, click here.



The MSDW provides researchers access to data on patients in the Mount Sinai Health System:

  • Over 11 million patient records
  • Over 87 million patient encounters

The majority of the data collected by the MSDW comes from the Epic Clarity and Caboodle databases, as Epic is the primary electronic health record (EHR) across the Mount Sinai Health System (MSHS).

Click here for more details on MSDW’s OMOP Data Contents.

Mount Sinai Facility

Epic Go-Live

Mount Sinai Hospital (MSH) 2011
Mount Sinai Queens 2013
Mount Sinai West 2018
Mount Sinai Morningside 2018
Mount Sinai Brooklyn 2019
Mount Sinai Beth Israel 2020


The clinical data are extracted from Mount Sinai’s Epic Caboodle database and other ancillary systems, transformed into the OMOP Common Data Model (CDM) format, and loaded to the MSDW database. These data are updated daily.

The MSDW is situated on the Minerva High-Performance Computing (HPC) cluster alongside other research data sets.

Supported by grant UL1TR004419 from the National Center for Advancing Translational Sciences, National Institutes of Health.