Scientific Computing and Data / Research Data ServicesData Ark: Data Commons / Kidney Precision Medicine Project (KPMP)

Kidney Precision Medicine Project (KPMP)

Overview

Introduction

The Kidney Precision Medicine Project (KPMP) is a prospective cohort study, whose goal is to use deep molecular phenotypes of kidney biopsies, along with longitudinally collected clinical phenotypic data, in order to develop new disease ontologies, classification systems, and treatments for acute kidney injury (AKI) and chronic kidney disease (CKD).

For each participant, kidney tissue is obtained for molecular phenotyping and clinical diagnosis. In addition to the kidney biopsy tissue, the study collects baseline (time of biopsy) and longitudinal biospecimens (including urine, plasma, serum, DNA, and stool) and demographic, clinical, and laboratory data.

Participant Population

The KPMP is focusing on participants with acute and chronic kidney disease, who account for a large portion of the public health concern as reported by research and federal data.

CKD

  • High priority populations include CKD in the setting of diabetes (diabetic kidney disease, DKD) and hypertension-associated CKD (H-CKD).

DM-R

  • A special population of people with long-standing type 1 diabetes (more than 25 years) who remain free of clinically-evident DKD (i.e. diabetes mellitus-resilient (DM-R) individuals also known as “Resistors”) will also be included. Study of the DM-R population using KPMP protocols offers a unique opportunity to identify protective factors against complications of diabetes mellitus.

AKI

  • The focus will be on acute intrinsic non-glomerular disease, primarily on acute tubular necrosis (ATN).

Inclusion / Exclusion Criteria

Inclusion:

  • Diabetic Kidney Disease (Type 1 or 2)
  • Hypertensive Kidney Disease
  • Acute Kidney Injury
  • Patient has had diabetes for many years but no clinical indications of CKD

Exclusion:

  • Age (under 18)
  • Glomerular disease, kidney transplant, malignancy, pregnancy
  • Increased biopsy complication risk

For more specific details regarding inclusion and exclusion criteria for our various study populations, see the full clinical protocol on the KPMP website.

Cite KPMP in your publications

The following citation must be used when citing KPMP data. KPMP follows the AMA standard for its citations.

The results here are in whole or part based upon data generated by the Kidney Precision Medicine Project. Accessed Month Day, Year. https://www.kpmp.org.

The Kidney Precision Medicine Project (KPMP) is supported by the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) through the following grants: U01DK133081, U01DK133091, U01DK133092, U01DK133093, U01DK133095, U01DK133097, U01DK114866, U01DK114908, U01DK133090, U01DK133113, U01DK133766, U01DK133768, U01DK114907, U01DK114920, U01DK114923, U01DK114933, U24DK114886, UH3DK114926, UH3DK114861, UH3DK114915, and UH3DK114937. We gratefully acknowledge the essential contributions of our patient participants and the support of the American public through their tax dollars.

NOTE: Remember to replace “Accessed Month Day, Year” with the correct specific details.

 

What Is Hosted Under Data Ark?

Files in the Open-access Data Repository by -omics type (last updated May 8, 2026).

The datasets available in the open-access data repository are a combination of raw and processed data from KPMP participant biopsies and reference tissue samples.

OMICS TYPE REFERENCE CKD AKI DM-R ALL
10x Multiome 73 96 64 24 257
3D Tissue Imaging and Cytometry 153 504 379 135 1171
ATAC-Seq 73 96 64 24 257
Biomarkers 13 18 18 18 18
Bulk Total/mRNA 39 29 68
Clinical Study Data 1 1 1 1 1
CODEX 75 319 212 96 702
CUT&RUN 128 95 30 13 266
DNA Methyl-seq 52 4 60
Imaging Mass Cytometry 32 56 48 136
Light Microscopic Whole Slide Images 110 4427 968 583 6088
Multimodal Imaging Mass Spectrometry 24 48 36 8 116
Pathology Descriptor Scoring 1 1 1 1 1
Regional Proteomics 19 53 49 119
Regional Transcriptomics 2 16 23 39
Segmentation Masks & Pathomics Vectors 5 263 78 15 361
Single-cell RNA-Seq 147 388 78 51 656
Single-nucleus RNA-Seq 208 176 100 30 504
Spatial Lipidomics 14 124 32 14 184
Spatial Metabolomics 24 162 44 20 250
Spatial N-Glycomics 16 36 24 18 94
Spatial Transcriptomics 131 478 310 120 1039

 

TOTAL FILES: 12,387

Access

To use this data, you must read, agree and sign the Data Use Agreement (you must be logged in through the Mount Sinai campus network or secure remote VPN). On Minerva, you can load module $ module load dataark to see the path variables.

 

Data Ark Data Sets

Please visit the Data Ark Data Set webpage to explore other data sets.