{"id":3699,"date":"2021-11-08T10:32:59","date_gmt":"2021-11-08T15:32:59","guid":{"rendered":"https:\/\/labs.icahn.mssm.edu\/minervalab\/?page_id=3699"},"modified":"2025-05-14T14:01:29","modified_gmt":"2025-05-14T18:01:29","slug":"about-data-ark","status":"publish","type":"page","link":"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/about-data-ark\/","title":{"rendered":"About Data Ark"},"content":{"rendered":"<p>[et_pb_section fb_built=&#8221;1&#8243; fullwidth=&#8221;on&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_fullwidth_menu menu_id=&#8221;14&#8243; menu_style=&#8221;centered&#8221; active_link_color=&#8221;#221f72&#8243; dropdown_menu_line_color=&#8221;#221f72&#8243; dropdown_menu_active_link_color=&#8221;#d80b8c&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; menu_font=&#8221;|600|||||||&#8221; menu_font_size=&#8221;16px&#8221; background_color=&#8221;#221f72&#8243; background_layout=&#8221;dark&#8221; sticky_position=&#8221;top&#8221;][\/et_pb_fullwidth_menu][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;0px||0px||false|false&#8221;][et_pb_row _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;||0px||false|false&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; header_font=&#8221;|700|||||||&#8221; header_text_color=&#8221;#221f72&#8243;]<\/p>\n<p><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/scientific-computing-and-data\/\">Scientific Computing and Data<\/a> \/ <a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/rds\/\">Research Data Services<\/a> \/ <a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/\">Data Ark: Data Commons<\/a> \/ About Data Ark<\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; specialty=&#8221;on&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;0px||||false|false&#8221;][et_pb_column type=&#8221;1_4&#8243; _builder_version=&#8221;3.25&#8243; custom_padding=&#8221;|||&#8221; custom_padding__hover=&#8221;|||&#8221;][et_pb_image src=&#8221;https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2021\/02\/Data_Ark_Final-1024&#215;640.jpg&#8221; title_text=&#8221;Data_Ark_Final&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_image][\/et_pb_column][et_pb_column type=&#8221;3_4&#8243; specialty_columns=&#8221;3&#8243; _builder_version=&#8221;3.25&#8243; custom_padding=&#8221;|||&#8221; custom_padding__hover=&#8221;|||&#8221;][et_pb_row_inner _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column_inner saved_specialty_column_type=&#8221;3_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_text admin_label=&#8221;About Data Ark Text&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; header_font=&#8221;|700|||||||&#8221; header_text_color=&#8221;#221f72&#8243; header_2_text_color=&#8221;#00aeef&#8221; header_2_font_size=&#8221;24px&#8221; hover_enabled=&#8221;0&#8243; sticky_enabled=&#8221;0&#8243;]<\/p>\n<h1>About Data Ark<\/h1>\n<p>The Data Ark team downloads, organizes and performs quality assurance and quality control on the data. The team also manages the data access process, answers questions on the data, and updates to the latest versions of the data sets. The Data Ark is located on Minerva at \/sc\/arion\/projects\/data-ark\/. This Mount Sinai data commons is guided by the FAIR principles [1]: making data more\u00a0<em>findable<\/em>,<em>\u00a0accessible<\/em>,<em>\u00a0interoperable and reusable<\/em>. Data Ark includes both public (restricted and unrestricted) and Sinai-generated data sets.<\/p>\n<p>The overarching goal of the Data Ark is to ensure that research data at Mount Sinai are managed, processed and combined in a way that optimizes the power, pace and relevance of our science.<\/p>\n<ul>\n<li><strong>Power<\/strong>: Scientists typically use only a tiny fraction of available data<\/li>\n<li><strong>Pace<\/strong>: Users will have rapid access to huge, powerful research data<\/li>\n<li><strong>Relevance<\/strong>: Our diverse patient population is ideal for testing the generalizability of our results<\/li>\n<\/ul>\n<p>Data Ark is an initiative led by Associate Professor Paul O\u2019Reilly and Dean for Scientific Computing and Data Patricia Kovatch, and supported by the Department of Genetics and Genomic Sciences and Scientific Computing. An advisory board has been convened to provide guidance and to help Data Ark become sustainable over time.<\/p>\n<p>We are supported by grant UL1TR004419 from the National Center for Advancing Translational Sciences, National Institutes of Health.<\/p>\n<p>&nbsp;<\/p>\n<h1>Access Data Ark<\/h1>\n<p>Effective from January 22, 2024, to access public, Mount Sinai-generated and restricted datasets, you must read, agree and sign the <a href=\"https:\/\/dataarkforms.hpc.mssm.edu\/\">Data Use Agreement <\/a>(you must be logged in through the Mount Sinai campus network or secure remote VPN). Access is granted within 24 hours, and on Minerva, you can load module <strong>$ module load dataark <\/strong>to see the path variables.<\/p>\n<p>The <strong>Data Use Agreement <\/strong>is accessible only through the Mount Sinai campus network or secure remote VPN. <a href=\"https:\/\/dataarkforms.hpc.mssm.edu\/\">Click here for the Data Use Agreement<\/a> and choose the data set that you would like to access from the drop-down list. From here you can follow the link to view and agree to the specific Data Use Agreement. Users will need to login with your Sinai account and password and will be able to choose only one data set at a time.<\/p>\n<p><strong>For more information and for all inquiries relating to the Data Ark, please email: <a href=\"mailto:hpchelp@hpc.mssm.edu\">hpchelp@hpc.mssm.edu<\/a>, or join our Data Ark Slack channel at <a href=\"https:\/\/join.slack.com\/t\/data-ark\/signup\"><span style=\"text-decoration: underline\">https:\/\/join.slack.com\/t\/data-ark\/signup <\/span><\/a> and signup using your Mount Sinai credentials. You will be able to interact with the researchers and the Data Ark group right away!\u00a0<\/strong><\/p>\n<p>&nbsp;<\/p>\n<h1>Data Ark User Feedback<\/h1>\n<p>We have asked Data Ark users for feedback on features and availability of data sets and solicit recommendations for improvement over time. Here are some specific recommendations and comments from Data Ark users:<\/p>\n<ul>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2025\/05\/2024-Data_Ark_Survey_Results_FINAL.pdf\">2024 User Comments and Feedback<\/a><\/li>\n<li><a href=\"https:\/\/liuy22.u.hpc.mssm.edu\/DataArk_UserSurvey\/2023-Data-Ark-User-Survey-Comments-and-Responses.pdf\">2023 User Comments and Feedback<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/msdw\/wp-content\/uploads\/sites\/350\/2023\/03\/2022-Data-Ark-User-Survey-Comments-and-Response.pdf\">2022 User Comments and Feedback<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2022\/03\/2022-Data-Ark-Survey-Results.pdf\">2021 User Comments and Feedback<\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h1>Data Ark Support Materials<\/h1>\n<p>Scientific Computing and Data hosts Data Ark Town Hall and training sessions that are open to current and prospective Data Ark users. Here are the session archives:<\/p>\n<ul>\n<li><a href=\"https:\/\/sc.u.hpc.mssm.edu\/DataArk\/Introduction_to_Data_Ark_20250327.mp4\">Introduction to Data Ark &#8211; March 2025 (Recording)<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/msdw\/wp-content\/uploads\/sites\/350\/2025\/03\/DataArk_Training_Spring2025.pdf\">Introduction to Data Ark &#8211; March 2025 (PowerPoint Slides)<\/a><\/li>\n<li><a href=\"https:\/\/sc.u.hpc.mssm.edu\/DataArk\/DataArk_Training_Recording_2024-10-04.mp4\">Introduction to Data Ark &#8211; October 2024 (Recording)<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2024\/10\/Data-Ark-Training_100424.pdf\">Introduction to Data Ark &#8211; October 2024 (PowerPoint Slides)<\/a><\/li>\n<li><a href=\"https:\/\/sc.u.hpc.mssm.edu\/DataArk\/DataArk_Training_Recording_2024-04-24.mp4\">Introduction to Data Ark &#8211; Mount Sinai Data Commons &#8211; April 2024 (Recording)<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2024\/04\/DataArk_Training_Spring2024.pdf\">Introduction to Data Ark &#8211; Mount Sinai Data Commons &#8211; April 2024 (PowerPoint Slides)<\/a><\/li>\n<li><a href=\"https:\/\/sc.u.hpc.mssm.edu\/DataArk\/DataArk_TownHall_Recording_2023-10-24.mp4\">Data Ark Town Hall &#8211; October 2023 (Recording)<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2023\/10\/DataArk_TownHall_Fall2023.pdf\">Data Ark Town Hall &#8211; October 2023 (PowerPoint Slides)<\/a><\/li>\n<li><a href=\"https:\/\/sc.u.hpc.mssm.edu\/DataArk\/DataArk_TownHall_Recording_2023-05-03.mp4\">Data Ark Town Hall &#8211; May 2023 (Recording)<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/msdw\/wp-content\/uploads\/sites\/350\/2023\/05\/Data-Ark-Town-Hall-0503_2023.pptx.pdf\">Data Ark Town Hall &#8211; May 2023 (PowerPoint Slides)<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/msdw\/wp-content\/uploads\/sites\/350\/2022\/12\/Data-Ark-Townhall_120222.pdf\">Data Ark Town Hall &#8211; December 2022 (PowerPoint Slides)<\/a><\/li>\n<li><a href=\"https:\/\/sc.u.hpc.mssm.edu\/DataArk\/DataArk_TownHall_Recording_2022-12-02.mp4\">Data Ark Town Hall &#8211; December 2022 (Recording)<\/a><\/li>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-content\/uploads\/sites\/342\/2022\/05\/Data-Ark-Townhall_052522.pptx.pdf\">Data Ark Town Hall &#8211; May 2022 (PowerPoint Slides)<\/a><\/li>\n<li><a href=\"https:\/\/sc.u.hpc.mssm.edu\/DataArk\/DataArk_TownHall_Recording_2022-05-25.mp4\">Data Ark Town Hall &#8211; May 2022 (Recording)<\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h1>Data Sets<\/h1>\n<p>The Data Ark is located on Minerva and the number, type, and diversity of data sets on the Data Ark are increasing on an ongoing basis.<\/p>\n<p>[\/et_pb_text][et_pb_button button_url=&#8221;https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/data-ark-data-sets\/&#8221; button_text=&#8221;Click here for data sets&#8221; button_alignment=&#8221;center&#8221; admin_label=&#8221;Data sets button&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_button=&#8221;on&#8221; button_text_color=&#8221;#FFFFFF&#8221; button_bg_use_color_gradient=&#8221;on&#8221; button_bg_color_gradient_start=&#8221;#00aeef&#8221; button_bg_color_gradient_end=&#8221;#221f72&#8243; button_bg_color_gradient_direction=&#8221;255deg&#8221; button_border_radius=&#8221;26px&#8221; button_font=&#8221;|600||on|||||&#8221; button_use_icon=&#8221;off&#8221; custom_margin=&#8221;20px||20px||false|false&#8221; custom_padding=&#8221;15px|30px|15px|30px|false|false&#8221; hover_enabled=&#8221;0&#8243; sticky_enabled=&#8221;0&#8243;][\/et_pb_button][et_pb_text admin_label=&#8221;Onboarding Text&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; header_font=&#8221;|700|||||||&#8221; header_text_color=&#8221;#221f72&#8243; header_2_text_color=&#8221;#00aeef&#8221; header_2_font_size=&#8221;24px&#8221;]<\/p>\n<p>&nbsp;<\/p>\n<h1>Onboarding Data Ark Data Sets<\/h1>\n<p>PI&#8217;s must complete a <a href=\"https:\/\/redcap.mountsinai.org\/redcap\/surveys\/?s=PCNDC9HRCAF4XJJ3\">REDCap form<\/a> and name expected research groups. Approval process is regulated according to data set size:<\/p>\n<ul>\n<li>=&lt;1 TB: Data Ark operations team will approve<\/li>\n<li>&gt;1 TB: must be approved by the Data Ark Advisory Board<\/li>\n<\/ul>\n<p><strong>Data Retention period:<\/strong> The original data owner will receive usage reports every quarter and will be alerted when other researchers are not using their data sets. If usage is low, then the data sets will be removed from Data Ark. Usage is evaluated annually.<\/p>\n<p>To read more information about the Data Ark Onboarding Policy, including data retention and contacts, please click the downloadable &#8220;Data Ark Onboarding\/Offboarding Policy&#8221; PDF below.<\/p>\n<p>[\/et_pb_text][et_pb_button button_url=&#8221;https:\/\/liuy22.u.hpc.mssm.edu\/onboarding\/Data_Ark_Policy_0324.pdf&#8221; button_text=&#8221;Data Ark Onboarding\/Offboarding Policy (PDF)&#8221; button_alignment=&#8221;center&#8221; admin_label=&#8221;Data Ark Policy download button&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; custom_button=&#8221;on&#8221; button_text_color=&#8221;#FFFFFF&#8221; button_bg_use_color_gradient=&#8221;on&#8221; button_bg_color_gradient_start=&#8221;#00aeef&#8221; button_bg_color_gradient_end=&#8221;#221f72&#8243; button_bg_color_gradient_direction=&#8221;255deg&#8221; button_border_radius=&#8221;26px&#8221; button_font=&#8221;|600||on|||||&#8221; button_use_icon=&#8221;off&#8221; custom_margin=&#8221;20px||20px||false|false&#8221; custom_padding=&#8221;15px|30px|15px|30px|false|false&#8221; button_text__hover_enabled=&#8221;off|desktop&#8221;][\/et_pb_button][et_pb_text admin_label=&#8221;Pricing\/Contact Text&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; header_font=&#8221;|700|||||||&#8221; header_text_color=&#8221;#221f72&#8243; header_2_text_color=&#8221;#00aeef&#8221; header_2_font_size=&#8221;24px&#8221;]&nbsp;<\/p>\n<h1>Contact Data Ark Team<\/h1>\n<p>The Data Ark team manages the data, data access, and data updates.\u00a0<strong>F<\/strong><strong>or all inquiries related to the Data Ark, especially to access or utilize data, please email:\u00a0<a href=\"mailto:hpchelp@hpc.mssm.edu\">hpchelp@hpc.mssm.edu<\/a><\/strong><\/p>\n<p>&nbsp;<\/p>\n<h1>Data Ark Slack Channel<\/h1>\n<p>Join our<strong>\u00a0Data Ark Slack channel at\u00a0<\/strong><a href=\"https:\/\/join.slack.com\/t\/data-ark\/signup\">https:\/\/join.slack.com\/t\/data-ark\/signup\u00a0<\/a>and sign up using your Mount Sinai credentials. You will be able to interact with the researchers right away!<\/p>\n<p>&nbsp;<\/p>\n<h1>Acknowledge CTSA<\/h1>\n<p>Please acknowledge CTSA a fund source for Data Ark in your ensuing publications as the following.<\/p>\n<p><strong>&#8220;This work was supported in part through the computational resources and staff expertise provided by Scientific Computing and Data at the Icahn School of Medicine at Mount Sinai and supported by the Clinical and Translational Science Awards (CTSA) grant UL1TR004419 from the National Center for Advancing Translational Sciences.&#8221;<\/strong>[\/et_pb_text][\/et_pb_column_inner][\/et_pb_row_inner][et_pb_row_inner _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_column_inner saved_specialty_column_type=&#8221;3_4&#8243; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][et_pb_divider _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221;][\/et_pb_divider][et_pb_text admin_label=&#8221;Quick Links&#8221; _builder_version=&#8221;4.9.0&#8243; _module_preset=&#8221;default&#8221; header_font=&#8221;|700|||||||&#8221; header_text_color=&#8221;#221f72&#8243; header_2_text_color=&#8221;#221f72&#8243;]<\/p>\n<h1>Data Ark Quick Links<\/h1>\n<ul>\n<li><a href=\"https:\/\/labs.icahn.mssm.edu\/minervalab\/resources\/data-ark\/data-ark-data-sets\/\">Data Ark Data Sets<\/a><\/li>\n<li><a href=\"https:\/\/dataarkforms.hpc.mssm.edu\/\">Data Use Agreement<\/a><\/li>\n<li><a href=\"https:\/\/redcap.mountsinai.org\/redcap\/surveys\/?s=LLTARNCPP7HYYT9X\">Suggest a New Data Set<\/a><\/li>\n<li><a href=\"mailto:data-ark-team@lists.mssm.edu\">Contact Data Ark Team<\/a><\/li>\n<\/ul>\n<p>[\/et_pb_text][\/et_pb_column_inner][\/et_pb_row_inner][\/et_pb_column][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Scientific Computing and Data \/ Research Data Services \/ Data Ark: Data Commons \/ About Data ArkAbout Data Ark The Data Ark team downloads, organizes and performs quality assurance and quality control on the data. The team also manages the data access process, answers questions on the data, and updates to the latest versions of [&hellip;]<\/p>\n","protected":false},"author":600,"featured_media":0,"parent":1321,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_et_pb_use_builder":"on","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":""},"class_list":["post-3699","page","type-page","status-publish","hentry"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/3699","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/users\/600"}],"replies":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/comments?post=3699"}],"version-history":[{"count":68,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/3699\/revisions"}],"predecessor-version":[{"id":9947,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/3699\/revisions\/9947"}],"up":[{"embeddable":true,"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/pages\/1321"}],"wp:attachment":[{"href":"https:\/\/labs.icahn.mssm.edu\/minervalab\/wp-json\/wp\/v2\/media?parent=3699"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}