{"id":4645,"date":"2022-05-04T12:54:04","date_gmt":"2022-05-04T16:54:04","guid":{"rendered":"https:\/\/labs.icahn.mssm.edu\/minervalab\/?page_id=4645"},"modified":"2024-01-10T17:57:17","modified_gmt":"2024-01-10T22:57:17","slug":"tcga","status":"publish","type":"page","link":"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/tcga\/","title":{"rendered":"TCGA"},"content":{"rendered":"<p>[et_pb_section fb_built=&#8221;1&#8243; fullwidth=&#8221;on&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_fullwidth_menu menu_id=&#8221;14&#8243; menu_style=&#8221;centered&#8221; fullwidth_menu=&#8221;on&#8221; active_link_color=&#8221;#d80b8c&#8221; dropdown_menu_bg_color=&#8221;#221f72&#8243; dropdown_menu_line_color=&#8221;#221f72&#8243; dropdown_menu_active_link_color=&#8221;#d80b8c&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; menu_font=&#8221;|600|||||||&#8221; menu_text_color=&#8221;#FFFFFF&#8221; menu_font_size=&#8221;16px&#8221; background_color=&#8221;#221f72&#8243; background_layout=&#8221;dark&#8221; sticky_position=&#8221;top&#8221;][\/et_pb_fullwidth_menu][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;0px||0px||false|false&#8221;][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;||0px||false|false&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;PATH&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;]<\/p>\n<p><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/scientific-computing-and-data\/\">Scientific Computing and Data<\/a> \/ <a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/rds\/\">Research Data Services<\/a> \/\u00a0<a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/\">Data Ark: Data Commons<\/a> \/ TCGA<\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; admin_label=&#8221;section&#8221; _builder_version=&#8221;4.9.0&#8243; custom_padding=&#8221;0px||||false|false&#8221;][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;|auto|-12px|auto||&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;TCGA-title name&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; header_font_size=&#8221;36px&#8221;]<\/p>\n<h1><strong><span style=\"color: #000080\">TCGA &#8211; The Cancer Genome Atlas Program<\/span><\/strong><\/h1>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row column_structure=&#8221;1_5,3_5,1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][et_pb_column type=&#8221;3_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_image src=&#8221;https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2022\/05\/tcga-png.png&#8221; alt=&#8221;tcga&#8221; title_text=&#8221;tcga-png&#8221; admin_label=&#8221;The main Image&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;-37px|||||&#8221;][\/et_pb_image][\/et_pb_column][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;|auto|35px|auto||&#8221; custom_padding=&#8221;28px|||||&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;Main_content&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; text_line_height=&#8221;1.5em&#8221; header_font=&#8221;|700|||||||&#8221; header_text_color=&#8221;#221f72&#8243; header_2_font_size=&#8221;24px&#8221;]<\/p>\n<h2>Overview<\/h2>\n<p><a href=\"https:\/\/www.cancer.gov\/about-nci\/organization\/ccg\/research\/structural-genomics\/tcga\">The Cancer Genome Atlas (TCGA)<\/a> is a landmark cancer genomics program that molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types. The program is a joint effort between National Cancer Institute and the National Human Genome Research Institute first established in 2016.<\/p>\n<p>Currently, two versions are hosted on the Data Ark: Version 31.0 and version 32.0. The gene model used as a reference across TCGA has been updated from GENCODE 22(GRC37\/hg19)&#8212;version 31 to GENCODE 36 (GRCh38\/hg38)&#8211;version 32. To learn more about the data sets from a different version, find the data release notes <a href=\"https:\/\/docs.gdc.cancer.gov\/Data\/Release_Notes\/Data_Release_Notes\/#data-release-320\">here.<\/a><\/p>\n<p>All the TCGA data sets belong to the \u201copen-access\u201d category and were obtained from the <a href=\"https:\/\/portal.gdc.cancer.gov\/\">Genomic Data Commons Data Portal<\/a>. The TCGA folder on Minerva Supercomputer hosts all the biospecimen, clinical, RNA-seq counts, WXS -Mutation Annotation Format (MAF), and the TCGA data sets from <a href=\"https:\/\/www.cbioportal.org\/datasets\">cBioPortal<\/a>.<\/p>\n<h4>TCGA Processed Data Sets<\/h4>\n<p>TCGA data set has been processed and uploaded to Data Ark by Dr.Deniz Demircioglu (<a href=\"mailto:deniz.demircioglu@mssm.edu\">deniz.demircioglu@mssm.edu<\/a>), who combined data from the biospecimen and clinical folder and consolidated all the RNA-seq counts files (over 11,000 patients) into 33 different outcomes. See the table below. For more information about the TCGA Study Abbreviations, <a href=\"https:\/\/gdc.cancer.gov\/resources-tcga-users\/tcga-code-tables\/tcga-study-abbreviations\">click here<\/a>.<\/p>\n<table style=\"width: 975px;height: 277px\">\n<tbody>\n<tr style=\"background-color: #00aeef\">\n<td style=\"width: 55px\"><span style=\"color: #ffffff\"><strong>\u00a0Abbrev.<\/strong><\/span><\/td>\n<td style=\"width: 343.986px\"><span style=\"color: #ffffff\"><strong>Study Name<\/strong><\/span><\/td>\n<td style=\"width: 11.0139px\"><span style=\"color: #ffffff\"><strong>\u00a0Abbrev.<\/strong><\/span><\/td>\n<td style=\"width: 263px\"><span style=\"color: #ffffff\"><strong>Study Name<\/strong><\/span><\/td>\n<td style=\"width: 46px\"><span style=\"color: #ffffff\"><strong>Abbrev.<\/strong><\/span><\/td>\n<td style=\"width: 331px\"><span style=\"color: #ffffff\"><strong>Study Name<\/strong><\/span><\/td>\n<\/tr>\n<tr style=\"background-color: #ffffff\">\n<td style=\"width: 55px\">\u00a0ACC<\/td>\n<td style=\"width: 343.986px\">Adrenocortical carcinoma<\/td>\n<td style=\"width: 11.0139px\">BLCA<\/td>\n<td style=\"width: 263px\">Bladder Urothelial carcinoma<\/td>\n<td style=\"width: 46px\">BRCA<\/td>\n<td style=\"width: 331px\">Breast invasive carcinoma<\/td>\n<\/tr>\n<tr style=\"background-color: #f5f5f5\">\n<td style=\"width: 55px\">CESC<\/td>\n<td style=\"width: 343.986px\">Cervical squamous cell carcinoma and endocervical adenocarcinoma<\/td>\n<td style=\"width: 11.0139px\">CHOL<\/td>\n<td style=\"width: 263px\">Cholangiocarcinoma<\/td>\n<td style=\"width: 46px\">COAD<\/td>\n<td style=\"width: 331px\">Colon adenocarcinoma<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 55px\">DLBC<\/td>\n<td style=\"width: 343.986px\">Lymphoid Neoplasm Diffuse Large B-cell Lymphoma<\/td>\n<td style=\"width: 11.0139px\">ESCA<\/td>\n<td style=\"width: 263px\">Esophageal carcinoma<\/td>\n<td style=\"width: 46px\">GBM<\/td>\n<td style=\"width: 331px\">Glioblastoma multiforme<\/td>\n<\/tr>\n<tr style=\"background-color: #f5f5f5\">\n<td style=\"width: 55px\">HNSC<\/td>\n<td style=\"width: 343.986px\">Head and Neck squamous cell carcinoma<\/td>\n<td style=\"width: 11.0139px\">KICH<\/td>\n<td style=\"width: 263px\">Kidney Chromophobe<\/td>\n<td style=\"width: 46px\">KIRC<\/td>\n<td style=\"width: 331px\">Kidney renal clear cell carcinoma<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 55px\">KIRP<\/td>\n<td style=\"width: 343.986px\">Kidney renal papillary cell carcinoma<\/td>\n<td style=\"width: 11.0139px\">LAML<\/td>\n<td style=\"width: 263px\">Acute Myeloid Leukemia<\/td>\n<td style=\"width: 46px\">LGG<\/td>\n<td style=\"width: 331px\">Brain Lower Grade Glioma<\/td>\n<\/tr>\n<tr style=\"background-color: #f5f5f5\">\n<td style=\"width: 55px\">LIHC<\/td>\n<td style=\"width: 343.986px\">Liver hepatocellular carcinoma<\/td>\n<td style=\"width: 11.0139px\">LUAD<\/td>\n<td style=\"width: 263px\">Lung adenocarcinoma<\/td>\n<td style=\"width: 46px\">LUSC<\/td>\n<td style=\"width: 331px\">Lung squamous cell carcinoma<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 55px\">MESO<\/td>\n<td style=\"width: 343.986px\">Mesothelioma<\/td>\n<td style=\"width: 11.0139px\">OV<\/td>\n<td style=\"width: 263px\">Ovarian serous cystadenocarcinoma<\/td>\n<td style=\"width: 46px\">PAAD<\/td>\n<td style=\"width: 331px\">Pancreatic adenocarcinoma<\/td>\n<\/tr>\n<tr style=\"background-color: #f5f5f5\">\n<td style=\"width: 55px\">PCPG<\/td>\n<td style=\"width: 343.986px\">Pheochromocytoma and Paraganglioma<\/td>\n<td style=\"width: 11.0139px\">PRAD<\/td>\n<td style=\"width: 263px\">Prostate adenocarcinoma<\/td>\n<td style=\"width: 46px\">READ<\/td>\n<td style=\"width: 331px\">Rectum adenocarcinoma<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 55px\">SARC<\/td>\n<td style=\"width: 343.986px\">Sarcoma<\/td>\n<td style=\"width: 11.0139px\">SKCM<\/td>\n<td style=\"width: 263px\">Skin Cutaneous Melanoma<\/td>\n<td style=\"width: 46px\">STAD<\/td>\n<td style=\"width: 331px\">Stomach adenocarcinoma<\/td>\n<\/tr>\n<tr style=\"background-color: #f5f5f5\">\n<td style=\"width: 55px\">TGCT<\/td>\n<td style=\"width: 343.986px\">Testicular Germ Cell Tumors<\/td>\n<td style=\"width: 11.0139px\">THCA<\/td>\n<td style=\"width: 263px\">Thyroid carcinoma<\/td>\n<td style=\"width: 46px\">THYM<\/td>\n<td style=\"width: 331px\">Thymoma<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 55px\">UCEC<\/td>\n<td style=\"width: 343.986px\">Uterine Corpus Endometrial Carcinoma<\/td>\n<td style=\"width: 11.0139px\">UCS<\/td>\n<td style=\"width: 263px\">Uterine Carcinosarcoma<\/td>\n<td style=\"width: 46px\">UVM<\/td>\n<td style=\"width: 331px\">Uveal Melanoma<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>Inside each outcome folder, you will see the following files:<\/p>\n<table style=\"width: 315px\">\n<tbody>\n<tr>\n<td style=\"width: 121px\">aliquot.tsv<\/td>\n<td style=\"width: 81px\">count.tsv<\/td>\n<td style=\"width: 141.606px\">fpkm.tsv<\/td>\n<td style=\"width: 128.394px\">sample.tsv<\/td>\n<\/tr>\n<tr style=\"background-color: #f5f5f5\">\n<td style=\"width: 121px\">analyte.tsv<\/td>\n<td style=\"width: 81px\">exposure.tsv<\/td>\n<td style=\"width: 141.606px\">fpkm_uq.tsv<\/td>\n<td style=\"width: 128.394px\">slide.tsv<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 121px\">clinical.tsv<\/td>\n<td style=\"width: 81px\">family_history.tsv<\/td>\n<td style=\"width: 141.606px\">portion.tsv<\/td>\n<td style=\"width: 128.394px\"><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>[\/et_pb_text][et_pb_text admin_label=&#8221;Access&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; text_line_height=&#8221;1.5em&#8221; header_2_font_size=&#8221;24px&#8221; hover_enabled=&#8221;0&#8243; sticky_enabled=&#8221;0&#8243;]<\/p>\n<h2>Access<\/h2>\n<p>Effective from January 22, 2024, you must read, agree and sign the <a href=\"https:\/\/dataarkforms.hpc.mssm.edu\/\">Data Use Agreement <\/a>(you must be logged in through the Mount Sinai campus network or secure remote VPN). Access is granted within 24 hours, and on Minerva, you can load module <strong>$ module load dataark <\/strong>to see the path variables.<\/p>\n<p>By using these data, you agree to acknowledge <a href=\"https:\/\/bings.mssm.edu\/\">BiNGS<\/a>\u2013the Tisch Cancer Institute Bioinformatics Core for Next-Generation Sequencing and the Data Ark team for all oral and written presentations, grand submission, awards, and publications resulting from any analyses of the data sets.<\/p>\n<h2>Data Ark Data Sets<\/h2>\n<p>Please visit the <a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/data-ark-data-sets\/\">Data Ark Data Set<\/a> webpage to explore other data sets.<\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Scientific Computing and Data \/ Research Data Services \/\u00a0Data Ark: Data Commons \/ TCGATCGA &#8211; The Cancer Genome Atlas ProgramOverview The Cancer Genome Atlas (TCGA) is a landmark cancer genomics program that molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types. The program is a joint effort between National Cancer [&hellip;]<\/p>\n","protected":false},"author":415,"featured_media":0,"parent":1321,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_et_pb_use_builder":"on","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":""},"class_list":["post-4645","page","type-page","status-publish","hentry"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/4645","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/users\/415"}],"replies":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/comments?post=4645"}],"version-history":[{"count":47,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/4645\/revisions"}],"predecessor-version":[{"id":7869,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/4645\/revisions\/7869"}],"up":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/1321"}],"wp:attachment":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/media?parent=4645"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}