MSDW Services

MSDW has Epic Electronic Health Record data from the Mount Sinai Health System, which includes demographics, encounters, labs, conditions, diagnoses, medications and more. Data is transformed from Epic into OMOP CDM and is updated daily. Reference OHDSI’s data dictionary for OMOP CDM. Custom data requests (which can include PHI) may be requested through the MSDW Data Request Form. To initiate the custom data request process, please submit a request through the MSDW ticketing system.

All data queries and tools are designed to meet the requirements of federal HIPAA law with Protected Patient Exclusion Criteria.


The Mount Sinai Data Warehouse provides researchers access to data on the over 12 million unique patients in the Mount Sinai Health System Epic Electronic Health Record (EHR). Data containing PHI is accessible through approved IRB documentation

MSDW Custom Data Set COVID-19 Research Data Data Marts
Description Work with MSDW analysts to compose custom SQL queries for research or QA.  See our process. The de-identified COVID-19 data sets include all patients at a Mount Sinai facility who have been screened for or diagnosed with COVID-19. The identified version of this data set is available with appropriate documents. Data is refreshed weekly. See more details. An MSDW Data Mart is a curated, customized data set with a specific focus. There are two categories of Data Marts: Analyst-generated curated data marts (e.g. Liver data), and User-requested data marts (i.e. your custom MSDW data request).
Data Types Patient Records with Encounters, Clinical Data, Diagnoses (ICD-9 and ICD-10), Medication Records (prescription and medication administrations), Test Results (including radiology and pathology), Clinical Documentations (including operative reports, discharge summaries). Click here for more information on MSDW data. Patient Encounters (Demographics, Comorbidities, Outcomes), Vital Signs, Labs, Medication Administrations, and Radiology Report Impressions. COVID-19 vaccine immunization file added in February 2021  
Access Request Custom Data Set form Download De-Identified Data Navigate to the MSDW Service Desk and submit a ticket for “Ask a Question.” Ticket must include an attached signed MSDW Database Access Agreement and IRB documentation (if applicable).
Cost $180/hour De-identified data is available to Mount Sinai Researchers. Researchers seeking a custom COVID-19 data set, requests are subject to a fee of $180/hour. Submit a request for custom data here. $180/hour
PHI Yes Yes Yes
Turnaround Time Typically a few weeks (depending on complexity) Download in seconds One to a few days (depending on complexity)
Advantages Most comprehensive and sophisticated way to search the largest database at Mount Sinai Mount Sinai COVID-19 data sets are refreshed weekly High-quality research data is curated by our Clinical Informaticists; data quality is maximized by professional processing and repeat use; expanded opportunities for collaboration and citations


Cohort Query Tools

Self-service query tools can be utilized to query de-identified MSDW data directly. For additional information on the data services offered, Digital Concierge hours are hosted weekly. Compare Leaf and ATLAS tools directly in their similarities, inclusion criteria, and capabilities.

Leaf ATLAS TriNetX
Description Web-based, lightweight drag-and-drop cohort query tool that quickly analyzes population demographics A web-based cohort query tool for database exploration, standardized vocabulary browsing, cohort definition, and patient cohort-level analysis A web-based cohort query tool
Access Use your Mount Sinai network username/password to login. Use your Mount Sinai network username/password to login Request access here. Log in to the TriNetX system using your email address and password.
Training Written Tutorial; PEAK Tutorial Written Tutorial; PEAK Tutorial; Videos PEAK Tutorial
Data Types

Patient demographics, diagnoses, procedures, medications, labs, orders, vitals, institutional patient cohorts (BioMe, IRW, etc.)

Patient demographics, diagnoses, procedures, medications, labs, orders, vitals

Patient demographics, diagnoses, procedures, medications, labs, orders, vitals

PHI No Yes, if IRB Approved De-identified data only
Cost No charge No charge No charge
Application Status Leaf Status and Roadmap ATLAS Status and Roadmap  
Advantages Can visualize demographic details of cohorts, drag-and-drop query feature; download de-identified patient cohort list Utilizes common data model and queries Offers a polished, commercially developed user interface

Protected Patient Information

In compliance with HIPAA privacy and security, all data queries exclude protected patient categories. These records are excluded from de-identified OMOP data sets and from PHI data sets unless explicitly approved by the investigator’s IRB (42 CFR Part 46). Click here for more information about Protected Patient Categories.

We are supported by grant UL1TR004419 from the National Center for Advancing Translational Sciences, National Institutes of Health.