{"id":4806,"date":"2022-05-11T23:34:36","date_gmt":"2022-05-12T03:34:36","guid":{"rendered":"https:\/\/labs.icahn.mssm.edu\/minervalab\/?p=4806"},"modified":"2022-05-11T23:34:39","modified_gmt":"2022-05-12T03:34:39","slug":"tcga","status":"publish","type":"post","link":"https:\/\/labs.icahn.mssm.edu\/minervalab\/tcga\/","title":{"rendered":"tcga"},"content":{"rendered":"<p>[et_pb_section fb_built=&#8221;1&#8243; fullwidth=&#8221;on&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_fullwidth_menu menu_id=&#8221;14&#8243; menu_style=&#8221;centered&#8221; fullwidth_menu=&#8221;on&#8221; active_link_color=&#8221;#221f72&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; menu_font=&#8221;|600|||||||&#8221; menu_text_color=&#8221;#FFFFFF&#8221; menu_font_size=&#8221;16px&#8221; background_color=&#8221;#00aeef&#8221; background_layout=&#8221;dark&#8221; sticky_position=&#8221;top&#8221;][\/et_pb_fullwidth_menu][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;0px||0px||false|false&#8221;][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;||0px||false|false&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;PATH&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;]<\/p>\n<p><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/scientific-computing-and-data\/\">Scientific Computing and Data<\/a> \/ <a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/rds\/\">Research Data Services<\/a> \/\u00a0<a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/\">Data Ark: Data Commons<\/a> \/ TCGA<\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; admin_label=&#8221;section&#8221; _builder_version=&#8221;4.9.0&#8243; custom_padding=&#8221;0px||||false|false&#8221;][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;|auto|-12px|auto||&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;TCGA-title name&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;]<\/p>\n<h1><strong><span style=\"color: #000080\">TCGA &#8211; The Cancer Genome Atlas Program<\/span><\/strong><\/h1>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row column_structure=&#8221;1_5,3_5,1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][et_pb_column type=&#8221;3_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_image src=&#8221;https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2022\/05\/tcga-png.png&#8221; alt=&#8221;tcga&#8221; title_text=&#8221;tcga-png&#8221; admin_label=&#8221;The main Image&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;-37px|||||&#8221;][\/et_pb_image][\/et_pb_column][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;|auto|35px|auto||&#8221; custom_padding=&#8221;28px|||||&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;Text&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; text_line_height=&#8221;1.5em&#8221; header_2_text_color=&#8221;#221f72&#8243;]<\/p>\n<p><span style=\"font-size: large\"><a href=\"https:\/\/www.cancer.gov\/about-nci\/organization\/ccg\/research\/structural-genomics\/tcga\">The Cancer Genome Atlas(TCGA)<\/a> is a landmark cancer genomics program, that molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types. The program is a joint effort between National Cancer Institute and the National Human Genome Research Institute since 2016.\u00a0<\/span><\/p>\n<p><span style=\"font-size: large\">Currently, two versions are hosted on the Data Ark. Version 31.0 and version 32.0. The gene model used as a reference across TCGA has been updated from GENCODE 22(GRC37\/hg19)&#8212;version 31 to GENCODE 36 (GRCh38\/hg38)&#8211;version32. To learn more about the data sets from a different version, find the data release notes <\/span><a href=\"https:\/\/docs.gdc.cancer.gov\/Data\/Release_Notes\/Data_Release_Notes\/#data-release-320\"><span style=\"font-size: large\">here<\/span><\/a><\/p>\n<p><span style=\"font-size: large\"> All the TCGA data sets downloaded belong to the \u201copen-access\u201d category and were obtained from the <a href=\"https:\/\/portal.gdc.cancer.gov\/\">Genomic Data Commons Data Portal.<\/a><\/span><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<h2>TCGA Processed Data Sets<\/h2>\n<table style=\"width: 1353px;height: 277px\">\n<tbody>\n<tr>\n<td style=\"width: 170.109px\">\u00a0<\/td>\n<td style=\"width: 512.891px\">\u00a0<\/td>\n<td style=\"width: 141px\">\u00a0<\/td>\n<td style=\"width: 497px\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 170.109px\">\u00a0<\/td>\n<td style=\"width: 512.891px\">\u00a0<\/td>\n<td style=\"width: 141px\">\u00a0<\/td>\n<td style=\"width: 497px\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 170.109px\">\u00a0<\/td>\n<td style=\"width: 512.891px\">\u00a0<\/td>\n<td style=\"width: 141px\">\u00a0<\/td>\n<td style=\"width: 497px\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 170.109px\">\u00a0<\/td>\n<td style=\"width: 512.891px\">\u00a0<\/td>\n<td style=\"width: 141px\">\u00a0<\/td>\n<td style=\"width: 497px\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 170.109px\">\u00a0<\/td>\n<td style=\"width: 512.891px\">\u00a0<\/td>\n<td style=\"width: 141px\">\u00a0<\/td>\n<td style=\"width: 497px\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 170.109px\">\u00a0<\/td>\n<td style=\"width: 512.891px\">\u00a0<\/td>\n<td style=\"width: 141px\">\u00a0<\/td>\n<td style=\"width: 497px\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 170.109px\">\u00a0<\/td>\n<td style=\"width: 512.891px\">\u00a0<\/td>\n<td style=\"width: 141px\">\u00a0<\/td>\n<td style=\"width: 497px\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 170.109px\">\u00a0<\/td>\n<td style=\"width: 512.891px\">\u00a0<\/td>\n<td style=\"width: 141px\">\u00a0<\/td>\n<td style=\"width: 497px\">\u00a0<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 170.109px\">\u00a0<\/td>\n<td style=\"width: 512.891px\">\u00a0<\/td>\n<td style=\"width: 141px\">\u00a0<\/td>\n<td style=\"width: 497px\">\u00a0<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row column_structure=&#8221;1_5,3_5,1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][et_pb_column type=&#8221;3_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; header_text_color=&#8221;#221f72&#8243;]<\/p>\n<h1>TCGA Processed data sets<\/h1>\n<p>The TCGA RNA-seq counts files have been processed into 33 folders.<\/p>\n<p>&nbsp;<\/p>\n<p>[\/et_pb_text][\/et_pb_column][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;Text&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; text_line_height=&#8221;1.5em&#8221;]<\/p>\n<div class=\"et_pb_column ui-sortable et-animated--vb et_pb_column_4_4 et_pb_column_3 et-first-child et-last-child et_pb_with_border\" data-address=\"3.0.0\">\n<div class=\"et_fb_editing_enabled et_pb_text et_pb_module ui-sortable et_pb_text_2 et-animated--vb et_pb_text_align_left et_pb_bg_layout_light et-first-child et-last-child\" data-address=\"3.0.0.0\">\n<div class=\"et_pb_text_inner\">\n<div class=\"et-fb-popover-tinymce\" data-shortcode-id=\"3.0.0.0-1635970177463\" data-quickaccess-editable=\"yes\">\n<div class=\"mce-content-body focus-visible\" data-focus-visible-added=\"\">\n<p><span style=\"font-size: large\">\u00a0<\/span><\/p>\n<p><span style=\"font-size: large\">\u00a0<\/span><\/p>\n<p><span style=\"font-size: large\">To use this data, <strong>NO\u00a0DUA form<\/strong> is required, you can access the data at the following path on Minerva &#8211; <\/span><span style=\"font-size: large\"><span class=\"s1\" style=\"color: #00aeef\">\/sc\/arion\/projects\/data-ark\/Public_Unrestricted\/gnomAD\u00a0<\/span><\/span> <span style=\"font-size: large\">or you can load module <strong>$ module load dataark <\/strong>to see the path variables.\u00a0<\/span><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"et-pb-draggable-spacing et-pb-draggable-spacing__spacing et-pb-draggable-spacing__top et-pb-draggable-spacing__padding et-pb-draggable-spacing--animated et-pb-draggable-spacing--no-size\">\n<div class=\"et-pb-draggable-spacing__hint\">\u00a0<\/div>\n<\/div>\n<div class=\"et-pb-draggable-spacing et-pb-draggable-spacing__spacing et-pb-draggable-spacing__bottom et-pb-draggable-spacing__padding et-pb-draggable-spacing--animated et-pb-draggable-spacing--hovered\">\n<div class=\"et-pb-draggable-spacing__hint\">\u00a0<\/div>\n<\/div>\n<div class=\"et-pb-draggable-spacing et-pb-draggable-spacing__spacing et-pb-draggable-spacing__right et-pb-draggable-spacing__padding et-pb-draggable-spacing--animated et-pb-draggable-spacing--no-size\">\n<div class=\"et-pb-draggable-spacing__hint\">\u00a0<\/div>\n<\/div>\n<div class=\"et-pb-draggable-spacing et-pb-draggable-spacing__spacing et-pb-draggable-spacing__left et-pb-draggable-spacing__padding et-pb-draggable-spacing--animated et-pb-draggable-spacing--no-size\">\n<div class=\"et-pb-draggable-spacing__hint\">\u00a0<\/div>\n<\/div>\n<div class=\"et-pb-draggable-spacing et-pb-draggable-spacing__spacing et-pb-draggable-spacing__top et-pb-draggable-spacing__margin et-pb-draggable-spacing__inner et-pb-draggable-spacing--draggable-edge et-pb-draggable-spacing--animated et-pb-draggable-spacing--no-size\">\n<div class=\"et-pb-draggable-spacing__hint\">\u00a0<\/div>\n<\/div>\n<div class=\"et-pb-draggable-spacing et-pb-draggable-spacing__spacing et-pb-draggable-spacing__right et-pb-draggable-spacing__margin et-pb-draggable-spacing__inner et-pb-draggable-spacing--draggable-edge et-pb-draggable-spacing--animated\">\n<div class=\"et-pb-draggable-spacing__hint\">\u00a0<\/div>\n<\/div>\n<div class=\"et-pb-draggable-spacing et-pb-draggable-spacing__spacing et-pb-draggable-spacing__bottom et-pb-draggable-spacing__margin et-pb-draggable-spacing__inner et-pb-draggable-spacing--draggable-edge et-pb-draggable-spacing--animated et-pb-draggable-spacing--no-size\">\n<div class=\"et-pb-draggable-spacing__hint\">\u00a0<\/div>\n<\/div>\n<div class=\"et-pb-draggable-spacing et-pb-draggable-spacing__spacing et-pb-draggable-spacing__left et-pb-draggable-spacing__margin et-pb-draggable-spacing__inner et-pb-draggable-spacing--draggable-edge et-pb-draggable-spacing--animated\">\n<div class=\"et-pb-draggable-spacing__hint\">\u00a0<\/div>\n<\/div>\n<div class=\"et-fb-mousetrap et-fb-mousetrap-move et-fb-mousetrap--row\">\u00a0<\/div>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_divider _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_divider][\/et_pb_column][\/et_pb_row][et_pb_row column_structure=&#8221;1_2,1_2&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;-53px|auto||auto||&#8221;][et_pb_column type=&#8221;1_2&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;]<\/p>\n<h1>Data Ark Data Sets<\/h1>\n<p><strong>Public data sets (unrestricted)<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/1000-genomes\/\">1,000 Genomes Project<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/gtex\/\">GTEx<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/gwas-summary-statistics\/\">GWAS Summary Stats<\/a><\/li>\n<li><span style=\"color: #d80b8c\">gnomAD<\/span><\/li>\n<\/ul>\n<p><strong>Public data sets (restricted)<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/uk-biobank\/\">UK Biobank<\/a><\/li>\n<\/ul>\n<p><strong>Mount Sinai generated data (unrestricted)<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/stop-covid-nyc-cohort\/\">STOP COVID NYC Cohort<\/a><\/li>\n<\/ul>\n<p><strong>Mount Sinai generated data (restricted)<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/mount-sinai-data-warehouse-covid-19-electronic-health-record-ehr-data-set\/\">MSDW COVID-19 EHR Data Set<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/mscic-covid-19-biobank\/\">Mount Sinai COVID-19 Biobank<\/a><\/li>\n<\/ul>\n<p><strong>Data access\u00a0<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/dataarkforms.hpc.mssm.edu\/\">Request form<\/a><\/li>\n<\/ul>\n<p>[\/et_pb_text][\/et_pb_column][et_pb_column type=&#8221;1_2&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][\/et_pb_row][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Scientific Computing and Data \/ Research Data Services \/\u00a0Data Ark: Data Commons \/ TCGATCGA &#8211; The Cancer Genome Atlas ProgramThe Cancer Genome Atlas(TCGA) is a landmark cancer genomics program, that molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types. The program is a joint effort between National Cancer Institute and [&hellip;]<\/p>\n","protected":false},"author":530,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_et_pb_use_builder":"on","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-4806","post","type-post","status-publish","format-standard","hentry","category-research-services"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/posts\/4806","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/users\/530"}],"replies":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/comments?post=4806"}],"version-history":[{"count":3,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/posts\/4806\/revisions"}],"predecessor-version":[{"id":4813,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/posts\/4806\/revisions\/4813"}],"wp:attachment":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/media?parent=4806"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/categories?post=4806"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/tags?post=4806"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}