Scientific Computing and Data / Mount Sinai Data Warehouse / COVID-19 Research Data Set
COVID-19 Electronic Health Record (EHR) Research Data Set
In March 2020, Mount Sinai Data Warehouse team created a near real-time de-identified COVID-19 data set. This data set has evolved to contain over 400 data elements and is updated weekly. The cohort includes patients with an encounter at a Mount Sinai facility who have been diagnosed with COVID-19, those who are under investigation for COVID-19, as well as those who have screened negative for COVID-19. The data set consists of multiple files (patient encounter, lab tests, medication administrations, diagnoses, vitals, radiology and immunizations) all of which are linked together using a masked medical record number and a unique patient encounter key.
Category | Includes |
Patient Encounters: Demographics | Age, Sex, Race, Ethnicity, Zip Code, Preferred Language, Insurance |
Patient Encounters: Outcomes | Mortality, AKI, VTE, Stroke, AMI |
Patient Encounters: Comorbidities | Asthma, COPD, CKD, HTN, Obesity, DM, HIV, Smoking, Cancer, CAD, CHF, OSA, A Fib, Liver Disease, UC, Crohn’s |
Patient Encounters: Encounter | Encounter Type, Department, Location of Care, Admission, Discharge, ED, ICU, Inpatient (Non ICU) |
Patient Encounters: Radiology Report Impressions | Chest x-rays, Chest CT scans |
Vital Signs | BMI, Max Temp, Max Heart Rate, Min Heart Rate, Max Respiratory Rate, Min Respiratory Rate, Max Blood Pressure, Min Blood Pressure, Min O2 Saturation |
Lab Tests | SARS-CoV-2 PCR, Antibody Assay, IL-6, IL-8, IL-1 Beta, TNF Alpha, D-Dimer, Ferritin, LDH, Fibrinogen, C-Reactive Protein, Procalcitonin, WBC, RBC, Blood Culture, AST, ALT, Alk Phos, Na, K, Cl, Calcium, eGFR, Bun, Creatinine, Hemoglobin, Lymphocyte, Eosinophil, Basophil, Monocyte, Neutrophil, Platelet, Hematocrit, aCL Antibody, B2GPI Antibody, Phospholipid Antibody, ESR, Haptoglobin, HbA1c, Oxyhemoglobin, Deoxyhemoglobin, Methemoglobin, Haptoglobin, BNP, MCV, MCH, MCHC, MPV, PIT, PT, INR, Albumin, Uric Acid, Bilirubin, pH, Anion Gap, pCO2, pO2, HCO3, CK, KC-MB, Troponin I, Transfusion (RBC, plasma, Platelet), Influenza, RSV |
Medication Administrations | Tocilizumab, Remdesivir, Sarilumab, Hydroxychloroquine, Anakinra, Azithromycin, Rivaroxiban, Apixaban, Enoxaparin, tPA, Heparin, Etanercept, Nitric oxide, Dopamine, Vasopressin, Norepinephrine, Epinephrine, Milrinone, Dobutamine, Phenylephrine, Methylprednisolone, Prednisone, Dexamethasone, Betamethasone, Famotidine |
Click here to view slides with more information on this COVID-19 de-identified data set
Download COVID-19 EHR Data Set
- Download de-identified COVID-19 research data – COVID-19 vaccine immunization file added 2/9/21
- View raw Epic Caboodle SQL COVID-19 queries for research reproducibility
Researchers seeking a custom COVID-19 data set, requests are subject to a fee of $180/hour. Submit a request for custom data here.