MSDW has Epic Electronic Health Record data from the Mount Sinai Health System, which includes demographics, encounters, labs, conditions, diagnoses, medications and more. Data is transformed from Epic into OMOP CDM and is updated daily. Reference OHDSI’s data dictionary for OMOP CDM. Custom data requests (which can include PHI) may be requested through the MSDW Data Request Form. To initiate the custom data request process, please submit a request through the MSDW ticketing system.
All data queries and tools are designed to meet the requirements of federal HIPAA law with Protected Patient Exclusion Criteria.
The Mount Sinai Data Warehouse provides researchers access to data on the over 12 million unique patients in the Mount Sinai Health System Epic Electronic Health Record (EHR). Data containing PHI is accessible through approved IRB documentation.
|–||MSDW Custom Data Set||COVID-19 Research Data||Data Marts|
|Description||Work with MSDW analysts to compose custom SQL queries for research or QA. See our process.||The de-identified COVID-19 data sets include all patients at a Mount Sinai facility who have been screened for or diagnosed with COVID-19. The identified version of this data set is available with appropriate documents. Data is refreshed weekly. See more details.||An MSDW Data Mart is a curated, customized data set with a specific focus. There are two categories of Data Marts: Analyst-generated curated data marts (e.g. Liver data), and User-requested data marts (i.e. your custom MSDW data request).|
|Data Types||Patient Records with Encounters, Clinical Data, Diagnoses (ICD-9 and ICD-10), Medication Records (prescription and medication administrations), Test Results (including radiology and pathology), Clinical Documentations (including operative reports, discharge summaries). Click here for more information on MSDW data.||Patient Encounters (Demographics, Comorbidities, Outcomes), Vital Signs, Labs, Medication Administrations, and Radiology Report Impressions. COVID-19 vaccine immunization file added in February 2021|
|Access||Request Custom Data Set form||Download De-Identified Data||Navigate to the MSDW Service Desk and submit a ticket for “Ask a Question.” Ticket must include an attached signed MSDW Database Access Agreement and IRB documentation (if applicable).|
|Cost||$180/hour||De-identified data is available to Mount Sinai Researchers. Researchers seeking a custom COVID-19 data set, requests are subject to a fee of $180/hour. Submit a request for custom data here.||$180/hour|
|Turnaround Time||Typically a few weeks (depending on complexity)||Download in seconds||One to a few days (depending on complexity)|
|Advantages||Most comprehensive and sophisticated way to search the largest database at Mount Sinai||Mount Sinai COVID-19 data sets are refreshed weekly||High-quality research data is curated by our Clinical Informaticists; data quality is maximized by professional processing and repeat use; expanded opportunities for collaboration and citations|
Cohort Query Tools
Self-service query tools can be utilized to query de-identified MSDW data directly. For additional information on the data services offered, Digital Concierge hours are hosted weekly. Compare Leaf and ATLAS tools directly in their similarities, inclusion criteria, and capabilities.
|Description||Web-based, lightweight drag-and-drop query tool that quickly analyzes population demographics using ICD-10 codes.||A web-based integrated platform for database exploration, standardized vocabulary browsing, cohort definition, and population-level analysis.||A global research network that optimizes clinical research for data at Mount Sinai.|
|Access||Use your Mount Sinai network username/password to login.||Login to the ATLAS System using your Sinai School network username/password. For users with only a Hospital account, request a School account through Sailpoint to access ATLAS.||Request access here. Log in to the TriNetX system using your email address and password.|
|Training||Tutorial||Tutorial, Videos||PEAK Tutorial|
|Data Types||Conditions (diagnoses), procedures, demographics, encounters (patient encounter locations), lab results, vitals, patient cohorts||Facilities, diagnoses, procedures, medications, labs, orders, patient demographics||Demographics, diagnoses, procedures, medications, labs, vitals|
|PHI||No||Yes, if IRB Approved|
|Application Status||Leaf Status and Roadmap||ATLAS Status and Roadmap|
|Turnaround||Seconds||Seconds||Seconds to minutes|
|Advantages/Disadvantages||Can visualize demographic details of cohorts, drag-and-drop query feature||Utilizes common data model and queries; numerous query analysis tools available in github||Platform utilizes real-time modeling and simulation of patient populations based on inclusion and exclusion criteria; contains MSDW patient data through June 2021; can search BioMe patients; can search Image Research Warehouse patients; can be slow to operate|
Protected Patient Information
In compliance with HIPAA privacy and security, all data queries exclude protected patient categories. These records are excluded from de-identified OMOP data sets and from PHI data sets unless explicitly approved by the investigator’s IRB (42 CFR Part 46). Click here for more information about Protected Patient Categories.
We are supported by grant UL1TR004419 from the National Center for Advancing Translational Sciences, National Institutes of Health.