{"id":1528,"date":"2021-01-19T11:46:00","date_gmt":"2021-01-19T16:46:00","guid":{"rendered":"https:\/\/labs.icahn.mssm.edu\/minervalab\/?page_id=1528"},"modified":"2024-01-09T11:40:19","modified_gmt":"2024-01-09T16:40:19","slug":"uk-biobank","status":"publish","type":"page","link":"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/uk-biobank\/","title":{"rendered":"UK Biobank"},"content":{"rendered":"<p>[et_pb_section fb_built=&#8221;1&#8243; fullwidth=&#8221;on&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_fullwidth_menu menu_id=&#8221;14&#8243; menu_style=&#8221;centered&#8221; fullwidth_menu=&#8221;on&#8221; active_link_color=&#8221;#d80b8c&#8221; dropdown_menu_line_color=&#8221;#221f72&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; menu_font=&#8221;|600|||||||&#8221; menu_text_color=&#8221;#FFFFFF&#8221; menu_font_size=&#8221;16px&#8221; background_color=&#8221;#221f72&#8243; background_layout=&#8221;dark&#8221; sticky_position=&#8221;top&#8221;][\/et_pb_fullwidth_menu][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;0px||0px||false|false&#8221;][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;||0px||false|false&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;breadcrumb&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;]<\/p>\n<p><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/scientific-computing-and-data\/\">Scientific Computing and Data<\/a> \/ <a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/rds\/\">Research Data Services<\/a> \/\u00a0<a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/\">Data Ark: Data Commons<\/a> \/ UK Biobank<\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; admin_label=&#8221;section&#8221; _builder_version=&#8221;4.9.0&#8243; custom_padding=&#8221;0px||||false|false&#8221;][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;Header&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; header_font_size=&#8221;36px&#8221; custom_margin=&#8221;-18px|||||&#8221;]<\/p>\n<h1><span style=\"color: #000080\"><strong>UK Biobank<\/strong><\/span><\/h1>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row column_structure=&#8221;1_5,3_5,1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;-40px|auto||auto||&#8221; custom_padding=&#8221;48px|||||&#8221;][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][et_pb_column type=&#8221;3_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_image src=&#8221;https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2021\/01\/da3.png&#8221; title_text=&#8221;da3&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_image][\/et_pb_column][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;Body text&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; text_line_height=&#8221;1.5em&#8221;]<\/p>\n<h2>Overview<\/h2>\n<p>The UK Biobank is the largest widely available genetic epidemiological data set in the world. The data set includes genotype, whole exome and whole genome sequencing data on 500,000 individuals, linked to a rich set of thousands of self-reported traits, hospital and primary care records, biomarker measures, mental health information and imaging data. Participants aged 40-69 years were recruited during 2006-2010 across 22 recruitment centres in the United Kingdom and have been followed up multiple times since collecting a huge range of information on their traits, biomarkers and diseases.[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row column_structure=&#8221;1_5,3_5,1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][et_pb_column type=&#8221;3_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_image src=&#8221;https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2021\/01\/da4.png&#8221; title_text=&#8221;da4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_image][\/et_pb_column][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;Body text&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; text_line_height=&#8221;1.5em&#8221;]<\/p>\n<h2>Access<\/h2>\n<blockquote><p>To use these data you must submit a project proposal to the UK Biobank (see <a href=\"https:\/\/www.ukbiobank.ac.uk\/enable-your-research\/apply-for-access\">here<\/a>) describing your planned research and receive its approval, or if you are collaborating with one of the labs at Sinai that already have an approved application (see list <a href=\"https:\/\/docs.google.com\/document\/d\/1JPhQysIMEKyg8W4jrRtT0gTo5o8ZRvuDv6WApAN2IXc\/edit?usp=sharing\">here<\/a>) then you could potentially be added to their application (via discussion with PI\/delegate) if your project is strongly aligned to theirs. However, we highly recommend making your own application, in which you can specify all research that you plan to conduct with the data and control who has access to it.<\/p><\/blockquote>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][et_pb_row column_structure=&#8221;1_5,3_5,1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][et_pb_column type=&#8221;3_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_image src=&#8221;https:\/\/www.ukbiobank.ac.uk\/media\/22vbx5sa\/access-fee-sept-2022.jpg?format=webp&amp;quality=80&#8243; title_text=&#8221;da4&#8243; force_fullwidth=&#8221;on&#8221; admin_label=&#8221;Cost of UKB Access&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; module_alignment=&#8221;center&#8221;][\/et_pb_image][\/et_pb_column][et_pb_column type=&#8221;1_5&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_column][\/et_pb_row][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;Body text&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; text_line_height=&#8221;1.5em&#8221; hover_enabled=&#8221;0&#8243; sticky_enabled=&#8221;0&#8243;]<\/p>\n<h2><\/h2>\n<blockquote><p>The three Tiers of fees are based broadly on the size of the respective datasets.<\/p><\/blockquote>\n<p>While the phenotype data for approved applications can only be accessed by investigators listed on the corresponding application, we host and manage all of the genetic data on the Data Ark (genotype data in <em>PLINK<\/em> format, imputed data in <em>BGEN<\/em> format), which can be used by any investigator on an approved UK Biobank application.<\/p>\n<p>The data access process, as well as details on the range of data types and how to perform UK Biobank analyses, have been described in a <a href=\"https:\/\/choishingwan.gitlab.io\/ukb-administration\/\">UK Biobank tutorial<\/a> produced by Judit Garcia-Gonzalez (<a href=\"mailto:judit.garciagonzalez@mssm.edu\">judit.garciagonzalez@mssm.edu<\/a>) and Sam (Shing Wan) Choi (<a href=\"mailto:shingwan.choi@mssm.edu\">shingwan.choi@mssm.edu<\/a>) from the O\u2019Reilly lab, which we highly recommend reading. This tutorial includes SQL scripts (written by Sam Choi) that will make accessing ICD-10 coded disease data from the biobank substantially easier. For more details on the UK Biobank, see the main website <a href=\"https:\/\/www.ukbiobank.ac.uk\/\">here<\/a>.<\/p>\n<p>To use this data through Data Ark, you must read, agree and sign the <a href=\"https:\/\/dataarkforms.hpc.mssm.edu\/\">Data Use Agreement<\/a> (you must be logged in through the Mount Sinai campus network or secure remote VPN).<\/p>\n<h2>More Information<\/h2>\n<p>The UK Biobank genetic data were uploaded to the Data Ark by Shing Wan Choi (<a href=\"mailto:shingwan.choi@mssm.edu\">shingwan.choi@mssm.edu<\/a>), on 11\/1\/20, and will be updated as new genetic data are provided by the UK Biobank.<\/p>\n<h2>Data Ark Data Sets<\/h2>\n<p>Please visit the <a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/data-ark-data-sets\/\">Data Ark Data Set<\/a> webpage to explore other data sets.[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Scientific Computing and Data \/ Research Data Services \/\u00a0Data Ark: Data Commons \/ UK BiobankUK BiobankOverview The UK Biobank is the largest widely available genetic epidemiological data set in the world. The data set includes genotype, whole exome and whole genome sequencing data on 500,000 individuals, linked to a rich set of thousands of self-reported [&hellip;]<\/p>\n","protected":false},"author":396,"featured_media":0,"parent":1321,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_et_pb_use_builder":"on","_et_pb_old_content":"<img class=\"aligncenter size-full wp-image-1514\" src=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2021\/01\/da3.png\" alt=\"\" width=\"977\" height=\"350\" \/>\r\n\r\nThe UK Biobank is the largest widely available genetic epidemiological data set in the world. The data set includes genotype, whole exome and whole genome sequencing data on 500,000 individuals, linked to a rich set of thousands of self-reported traits, hospital and primary care records, biomarker measures, mental health information and imaging data. Participants aged 40-69 years were recruited during 2006-2010 across 22 recruitment centres in the United Kingdom and have been followed up multiple times since collecting a huge range of information on their traits, biomarkers and diseases.\r\n\r\n<img class=\"aligncenter size-full wp-image-1515\" src=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2021\/01\/da4.png\" alt=\"\" width=\"940\" height=\"334\" \/>\r\n\r\n\u00a0\r\n\r\nTo use these data you must submit a project proposal to the UK Biobank (see <a href=\"https:\/\/www.ukbiobank.ac.uk\/enable-your-research\/apply-for-access\">here<\/a>) describing your planned research and receive its approval, or if you are collaborating with one of the labs at Sinai that already have an approved application (see list <a href=\"https:\/\/www.dropbox.com\/s\/jzlicps619hc2vz\/List_UKB_aplication_DA.pdf?dl=0\">here<\/a>) then you could potentially be added to their application (via discussion with PI\/delegate) if your project is strongly aligned to theirs. However, we highly recommend making your own application, in which you can specify all research that you plan to conduct with the data and control who has access to it.\r\n\r\nWhile the phenotype data for approved applications can only be accessed by investigators listed on the corresponding application, we host and manage all of the genetic data on the Data Ark (genotype\/exome data in <em>PLINK<\/em> format, imputed data in <em>BGEN<\/em> format), which can be used by any investigator on an approved UK Biobank application.\r\n\r\nThe data access process, as well as details on the range of data types and how to perform UK Biobank analyses, have been described in a <a href=\"https:\/\/choishingwan.gitlab.io\/ukb-administration\/\">UK Biobank tutorial<\/a> produced by Judit Garcia-Gonzalez (<a href=\"mailto:judit.garciagonzalez@mssm.edu\">judit.garciagonzalez@mssm.edu<\/a>) and Sam (Shing Wan) Choi (<a href=\"mailto:shingwan.choi@mssm.edu\">shingwan.choi@mssm.edu<\/a>) from the O\u2019Reilly lab, which we highly recommend reading. This tutorial includes SQL scripts (written by Sam Choi) that will make accessing ICD-10 coded disease data from the biobank substantially easier. For more details on the UK Biobank, see the main website <a href=\"https:\/\/www.ukbiobank.ac.uk\/\">here<\/a>.\r\n\r\nThe UK Biobank genetic data were uploaded to the Data Ark by Shing Wan Choi (<a href=\"mailto:shingwan.choi@mssm.edu\">shingwan.choi@mssm.edu<\/a>), on 11\/1\/20, and will be updated as new genetic data are provided by the UK Biobank.\r\n\r\nTo use these data, you must read, agree and sign the <a href=\"https:\/\/dataarkforms.hpc.mssm.edu\/\"><strong><em>Data Use Agreement here<\/em><\/strong><\/a>.","_et_gb_content_width":"","footnotes":""},"class_list":["post-1528","page","type-page","status-publish","hentry"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/1528","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/users\/396"}],"replies":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/comments?post=1528"}],"version-history":[{"count":40,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/1528\/revisions"}],"predecessor-version":[{"id":7825,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/1528\/revisions\/7825"}],"up":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/1321"}],"wp:attachment":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/media?parent=1528"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}