Wiki source code of Getting Started with HDC
Last modified by Dennis Segebarth on 2025/02/24 16:29
Show last authors
author | version | line-number | content |
---|---|---|---|
1 | {{box cssClass="floatinginfobox" title="Table of Contents"}} | ||
2 | {{toc/}} | ||
3 | {{/box}} | ||
4 | |||
5 | = Accessing HealthDataCloud (HDC) = | ||
6 | |||
7 | == Creating a User Account == | ||
8 | |||
9 | All active EBRAINS users can use their EBRAINS credentials to create a user account in the HealthDataCloud central node platform. If you don’t already have a valid EBRAINS account, please visit the [[EBRAINS registration page>>url:https://ebrains.eu/register/]] to obtain one. | ||
10 | |||
11 | To create an HDC user account: | ||
12 | |||
13 | * Visit the HDC Portal at [[https:~~/~~/hdc.humanbrainproject.eu/>>url:https://hdc.humanbrainproject.eu/]] and click the **Login** button. | ||
14 | * ((( | ||
15 | Click the **EBRAINS **button and enter your EBRAINS username and password. | ||
16 | ))) | ||
17 | * ((( | ||
18 | Review and familiarize yourself with the **Terms of Use**, then click **Accept **to finalize your registration. | ||
19 | |||
20 | * Note, the **Accept **button isn’t activated until you scroll down to the end of the document. You may export the Terms of Use as a PDF file or review them at any time from both the [[HDC website>>url:https://hdc.humanbrainproject.eu]] or at the bottom left corner (“Terms of Use”) of each page after logging into the HDC. | ||
21 | ))) | ||
22 | |||
23 | Your personal information is collected to provide a user account and access to HDC services in accordance with the [[HDC Privacy Policy>>url:https://object.hdc.humanbrainproject.eu/public-resources/HDC-Privacy-Policy.pdf]]. | ||
24 | |||
25 | == Discovering the HDC - a Quick Start Guide == | ||
26 | |||
27 | With the release of Pilot-HDC v1.3, you can now discover all features of the HealthDataCloud central node platform on a self-serve basis. Here, we prepared a quick start guide that you can follow along, or explore the platform on your own - it´s now easier than ever! | ||
28 | |||
29 | (% class="box warningmessage" %) | ||
30 | ((( | ||
31 | Important: Please note that the HDC central node platform has not yet undergone a GDPR audit and you are, therefore, not allowed to upload any sensitive data to the platform. | ||
32 | ))) | ||
33 | |||
34 | === Join the HealthDataCloud Test Project === | ||
35 | |||
36 | Before being able to test the platforms features, you have to join one of its Projects. A **Project** is an isolated, access-controlled data management unit in HDC that holds all of the data and computational resources relating to a research project. Projects are owned by a **Project Administrator**,** **a role analogous to an academic Principal Investigator. Usually, joining a Project requires the corresponding Project Admin to invite you to join. However, for testing and discovering of the HDC features, we created the **HealthDataCloud Test Project**, which you can join yourself at any moment by the click of a button. To do so: | ||
37 | |||
38 | 1. Go to the **Projects** page using the the top-level navigation bar: | ||
39 | \\[[image:Projects_navigation_bar.png||height="80" width="839"]] | ||
40 | |||
41 | 1. Click on the **Join Starting Project** button: | ||
42 | [[image:Join_starting_project.png||height="291" width="827"]] | ||
43 | |||
44 | 1. A pop-out window will open and confirm that you successfully joined the HealthDataCloud Test Project with the role of a Project Collaborator (see [[this section>>https://xwiki.hdc.humanbrainproject.eu/bin/view/userguide/Managing%20HDC%20Projects/#HProjectRoles]] of our User Guide for more information about the different roles and associated permissions) and remind you again to **not upload any sensitive information or data **to this Project: | ||
45 | \\[[image:Screenshot 2024-11-25 104224.png||height="294" width="458"]] | ||
46 | |||
47 | You can now proceed to the landing page of the HealthDataCloud Test Project, either by clicking on the **Go to Project** button in the pop-out, or via the **Projects** page using the navigation bar, where the HealthDataCloud Test Project will now be listed: | ||
48 | |||
49 | [[image:HDC_test_project_on_Projects_page.png||height="283" width="869"]] | ||
50 | |||
51 | |||
52 | The landing page of the HealthDataCloud Test Project will look like this (please refer to [[this section>>https://xwiki.hdc.humanbrainproject.eu/bin/view/userguide/HDC%20Portal%20Navigation/#HProjectsInterface]] of our User Guide article to learn more about the Projects Interface): | ||
53 | |||
54 | [[image:Landing_page_HDC_test_project.png||height="430" width="872"]] | ||
55 | |||
56 | |||
57 | (% class="box infomessage" %) | ||
58 | ((( | ||
59 | Access permissions within the HDC are tied to your EBRAINS account and setting of new permissions may sometimes take a while to be updated. This can sometimes cause an "Access denied" error, when you try to access the HealthDataCloud Test Project after completing the steps above. To enforce an update of your permissions in your EBRAINS Account, simply logout and log back in to the HDC central node platform. The associated permissions of your EBRAINS account are now refreshed and you will be able to access the HealthDataCloud Test Project. | ||
60 | ))) | ||
61 | |||
62 | === Learn about Zone Restrictions === | ||
63 | |||
64 | As a Research Data Management Platform, the central features of the HDC evolve around data flows, and how compliance with data privacy regulations can be ensured by default and by design. In this quick start guide, we´ll help you gain a general understanding of the Platforms design and access restrictions that apply in it´s different zones. Once you complete this guide, however, we highly recommend you have a look at our in-depth articles about the respective topics (for instance, see [[Working with Project Files in the Portal>>https://xwiki.hdc.humanbrainproject.eu/bin/view/userguide/Working%20with%20HDC%20Project%20Files%20in%20the%20Portal/]]), which introduce many additional features we won´t touch upon here. | ||
65 | |||
66 | Any data that is uploaded to the platform first lands in the //Green Room// zone of the respective Project. The //Green Room// is a dedicated and highly isolated storage area within each Project, where only the Project Administrator(s) and the user who uploaded the data have access to it. This allows the Project Administrator(s) to verify that the data are compliant with the respective Project’s data management plan before being copied to the //Core //zone of the Project. This is important, as data in the //Core// automatically becomes accessible to all members who have at least the Project Collaborator role (see [[this section>>https://xwiki.hdc.humanbrainproject.eu/bin/view/userguide/Managing%20HDC%20Projects/#HProjectRoles]] of our User Guide for more information about the different roles and associated permissions). Data that was transferred into the //Core// zone of a Project can then also be organized into Datasets. In brief, Datasets are collections of related Project files, folders, and detailed metadata. The Dataset feature allows users to organize Project files, enrich with annotations, validate data structures, and publish controlled versions for sharing (see [[this article>>https://xwiki.hdc.humanbrainproject.eu/bin/view/userguide/Working%20with%20HDC%20Datasets/]] of our User Guide for more information about Datasets). | ||
67 | |||
68 | The HDC also offers Workspace Tools, i.e. direct integration of computational tools like JupyterHub that allow you to view, process, analyze, or annotate and describe your data. In general, Workspace Tools can be deployed in both //Green Room //and //Core// zones, depending on the Projects needs. In the HealthDataCloud Test Project, however, all Workspace Tools (the virtual programming environment JupyterHub, the business intelligence tool Superset, the Virtual Machine interface Guacamole, and the documentation tool XWiki) are deployed in the //Core// zone. | ||
69 | |||
70 | === Upload Data === | ||
71 | |||
72 | Now that you have a high-level understanding of the different zones within the HDC, let´s start by uploading some test data to the platform via the it´s web interface, i.e. the Portal. Please be reminded again, however, to abide by the Platform Terms of Use and any Project-specific restrictions when using the Portal, and - specifcally - that **you are not allowed to upload any sensitive data to the HealthDataCloud Test Project**. To upload data: | ||
73 | |||
74 | 1. Go to the **File Explorer// //**of the HealthDataCloud Test Project: | ||
75 | \\[[image:File_explorer.png||height="410" width="832"]] | ||
76 | |||
77 | 1. You automatically land in your home directory within the //Green Room// zone, which is also indicated in the UI. Feel free to create a new folder and, once you selected the destination folder to which you´d like to upload, please click on **Upload:** | ||
78 | \\[[image:Upload.png||height="318" width="827"]] | ||
79 | |||
80 | 1. A pop-out window will open and prompt you to specify the file(s) or folder(s) you´d like to upload. Once you selected data to upload, click on **Upload** to start the upload, or click on **Cancel** to abort. Once more: **Please remember that you are not allowed to upload sensitive data to the HealthDataCloud Test Project.** | ||
81 | \\[[image:1740414584737-871.png||height="218" width="381"]] | ||
82 | |||
83 | 1. You can check the progress of the upload by clicking on the **File Status Icon**. Please refer to [[this article>>https://xwiki.hdc.humanbrainproject.eu/bin/view/userguide/Working%20with%20HDC%20Project%20Files%20in%20the%20Portal/]] in our User Guide for many more features that are available when uploading data to the HDC, including the [[//Resumable Upload//>>https://xwiki.hdc.humanbrainproject.eu/bin/view/userguide/Working%20with%20HDC%20Project%20Files%20in%20the%20Portal/#HResumingFailedUploads]] feature that allows you to continue previous uploads that got interrupted, e.g. due to unstable network connections. | ||
84 | |||
85 | Congratulations - you now know about the most important and basic functionality of the HDC. As a next step, you´d usually want to copy the newly uploaded data from the //Green Room// to the //Core// of the Project, in order to work on them collaboratively with your team. If you are not a Project Administrator, though, you do not have the permissions to directly copy data between zones. Instead, you will have to open a //Copy to Core Request//, which needs to be approved by a Project Administrator - giving them the chance to validate that the data complies with the Projects Data Management plan (see [[this section>>https://xwiki.hdc.humanbrainproject.eu/bin/view/userguide/Working%20with%20HDC%20Project%20Files%20in%20the%20Portal/#HRequestingFileCopyfromtheGreenRoomtotheCore]] in our User Guide for a more detailed description of this process). In our Quick Start Guide, we´ll instead continue with working on some data that already resides in the //Core// of the Project and can be accessed collaboratively. | ||
86 | |||
87 | === Working with Data in the //Core// === | ||
88 | |||
89 | As outlined in above section //Learn about Zone Restrictions//, data in the //Core// zone of a Project is shared with all Project members that have at least the //Project Collaborator// role to foster collaboration. You can browse all accessible data by using again the **File Explorer**, click on **Core**, and then navigate to the folder of interest. In this Quick Start Guide, we will now introduce how you can leverage Workspace Tools like JupyterHub to process, analyze, and visualize your data. For this, | ||
90 | |||
91 | 1. Click on the **JupyterHub **icon to launch your own server within the HealthDataCloud Test Project: | ||
92 | \\[[image:JHub.png||height="399" width="827"]] | ||
93 | |||
94 | 1. If prompted, select to "Sign in with Keycloak" which allows you to continue using single sign-on (SSO). Next, make sure to select the //Datascience Environment// which will launch your instance of JupyterHub with all the required packages preinstalled. After clicking on start, you can see the status of your server launch indicated by a progress bar: | ||
95 | \\[[image:DataScience.png||height="197" width="836"]] | ||
96 | |||
97 | 1. Once your server has launched, you can use JupyterHub as you´re used to in order to create Jupyter Notebooks or write scripts. For this Quick Start Guide, we´ll continue to use the **Terminal** in order to leverage the platform´s command line interface (CLI), //pilotcli// to move data between your Project and the Workspace Tools: | ||
98 | \\[[image:StartTerminal.png||height="584" width="831"]] | ||
99 | |||
100 | 1. Now you´ll authenticate your session with the //pilotcli// by executing the command {{code language="none"}}pilotcli user login{{/code}} to initiate the login process. This will output a link: | ||
101 | \\[[image:pilotcli_login.png||height="428" width="827"]] | ||
102 | |||
103 | 1. Please use a new browser tab or window to access the link from step 4. Then grant the requested permission to complete your login process: | ||
104 | \\[[image:grant_permissions.png||height="329" width="448"]] | ||
105 | |||
106 | 1. Back in JupyterHub, the Terminal will confirm your successful login, which allows you to perform the final step of this section, which is to copy a demo Juypter Notebook that guides you through the entire remaining process by executing the following command in the JupyterHub Terminal: {{code language="none"}}pilotcli file sync hdctestproject/dsegebarth/tabula_muris_facs/tabula_muris_demo.ipynb .{{/code}} | ||
107 | \\[[image:download_demo_ipynb.png||height="440" width="830"]] | ||
108 | |||
109 | 1. This will download a demo Jupyter Notebook with rich annotations and explanations, guiding you through the process of how a typical analysis workflow within HDC using Jupyter Notebooks could look like. Please feel free to follow along and execute the cells of the notebook, which will download the remaining data as needed. | ||
110 | \\[[image:Demo_NB.png||height="312" width="830"]] | ||
111 | |||
112 | There are obviously many more features that the HDC has to offer, which are explained in great detail in other sections of this User Guide. We hope this Quick Start Guide allowed you to gain a good understanding of the core features and functionalities of the HDC and how they can support your collaborative day to day work. For more information on how to request the creation of your own HDC Project that allows you to on-board your team and specify the computational resources needed, please have a look at the following sections. | ||
113 | |||
114 | == Obtaining your own HDC Project == | ||
115 | |||
116 | A **Project** is an isolated, access-controlled data management unit in HDC that holds all of the data and computational resources relating to a research project. Projects are owned by a **Project Administrator**,** **a role analogous to an academic Principal Investigator. If you’re interested in starting up your own HDC Project, please contact the HDC team (see //Contact Us//). Access to HDC is governed by the [[HDC Access Policy>>url:https://object.hdc.humanbrainproject.eu/public-resources/HDC-Access-Policy.pdf]]. | ||
117 | |||
118 | == Joining a Project == | ||
119 | |||
120 | If a Project Administrator has invited you to join an HDC Project and you already have an HDC user account, you’ll receive an email informing you of your new Project membership and role. You’ll be able to access the Project the next time you log in. | ||
121 | |||
122 | If you received an invitation to join an HDC Project but don’t already have a user account, you can complete your account registration following the instructions in the invitation email (see //Creating a User Account//) and then log in to the Portal and access your Project. | ||
123 | |||
124 | If you’re a Project Administrator, the Platform Administrator will invite you to your new Project. Consult the article //Managing HDC Projects// for more information. | ||
125 | |||
126 | == Changing your Password or Forgot Password == | ||
127 | |||
128 | Your HDC password is your EBRAINS password. | ||
129 | |||
130 | * To change your EBRAINS password, sign into [[EBRAINS>>url:https://www.ebrains.eu]] and click Manage Account in the upper right corner. | ||
131 | * If you forgot your EBRAINS password, click Forgot Password from the EBRAINS Login window and follow the prompts to create a new password. | ||
132 | |||
133 | == Logging into the HDC Portal == | ||
134 | |||
135 | * ((( | ||
136 | Open the [[HDC Portal>>url:https://hdc.humanbrainproject.eu/]] in a supported web browser. | ||
137 | |||
138 | * Popular browsers like Google Chrome, Mozilla Firefox, Microsoft Edge, and Apple Safari are supported. Some older browser versions as well as other browsers like Internet Explorer may not be supported. | ||
139 | ))) | ||
140 | * Click the **Login **button | ||
141 | * ((( | ||
142 | Click the **EBRAINS **button and enter your EBRAINS username and password. | ||
143 | |||
144 | * If you don’t already have a valid EBRAINS account, please visit the [[EBRAINS registration page>>url:https://ebrains.eu/register/]] to obtain one, then return to the [[HDC Portal>>url:https://hdc.humanbrainproject.eu]] and log in with your EBRAINS username and password. | ||
145 | ))) | ||
146 | |||
147 | == Viewing your User Profile == | ||
148 | |||
149 | Click your username in the top right corner of any page and click **Account **to open your account profile page. The account page displays your user profile information, Project membership, and recent activities relating to your account. | ||
150 | |||
151 | == Logging out of the HDC Portal == | ||
152 | |||
153 | It’s recommended to always log out of the Portal whenever you finish using the platform or step away from your workstation. | ||
154 | |||
155 | * To log out of your session manually, click your username in the top right of the page and select **Logout**. | ||
156 | * Idle sessions are logged out for added security. After a period of inactivity, a warning reminds you that your session is about to expire. If no further action is performed and no active processes are running, your session will be logged out automatically. | ||
157 | |||
158 | = How to get help = | ||
159 | |||
160 | == Consult the User Guide == | ||
161 | |||
162 | This guide provides information on how to use the features of the HDC platform. | ||
163 | |||
164 | == Submit a Support Inquiry == | ||
165 | |||
166 | * Users logged into the HDC portal can view the Support panel by clicking **Support **from the top right of any page. Here, you will find Frequently Asked Questions (FAQs) and, after scrolling to the bottom of the support panel, a form to submit a Support inquiry. | ||
167 | * Anyone can submit an inquiry to the [[EBRAINS support team>>url:https://www.ebrains.eu/contact]]. | ||
168 | |||
169 | == Watch for Upcoming Maintenance updates == | ||
170 | |||
171 | The support team announces planned maintenance with notifications displayed after logging into the portal. If you see an Upcoming Maintenance notice, take note of future downtime periods and plan your platform use to minimize the impact on your work. | ||
172 | |||
173 | == Contact us == | ||
174 | |||
175 | * Log in and submit a Support inquiry from the Portal Support section. | ||
176 | * Submit a support inquiry to the [[EBRAINS Support Team>>url:https://www.ebrains.eu/contact]]. | ||
177 | |||
178 | = HDC Portal organization = | ||
179 | |||
180 | The Portal consists of the following areas: | ||
181 | |||
182 | * Main Menu (common to all pages) | ||
183 | * Dashboard | ||
184 | * Projects Landing Page | ||
185 | * Projects Interface | ||
186 | * Datasets | ||
187 | * Platform Management (for Platform Administrators only) | ||
188 | |||
189 | Detailed information on these components is offered in dedicated sections of the User Guide. | ||
190 | |||
191 | HDC also offers a full-featured Command Line Interface, pilotcli, a binary executable program that provides advanced users with convenient tools for performing file actions and platform-related tasks programmatically. For more information on the Command Line Interface, see //Working with HDC Project Files in the Command Line Interface//. | ||
192 | |||
193 | |||
194 | ---- | ||
195 | |||
196 | Copyright © 2023-2024 [[Indoc Systems>>url:https://www.indocsystems.com]]. | ||
197 | |||
198 | HealthDataCloud is powered by Pilot technology, a product of [[Indoc Systems>>url:https://www.indocsystems.com]]. |