Analyzing Data ******** During the :bdg-info:`Planning Research` stage, a :bdg-primary:`Project folder` for storing research data is not yet available. Therefore, when dealing with any data (real or simulated) during the :bdg-info:`Planning Research` stage, you will most often want to store it in your :bdg-primary:`Home Drive`. This is because, since the :bdg-primary:`Home Drive` is mounted on :bdg-primary:`High Performance Storage`, you will be able to work with the :bdg-primary:`HPC Cluster` which can speed up your analysis time. Otherwise, you would use the :bdg-primary:`local storage` on your DCCN-issued PC. .. Note:: It is essential that data generated in the :bdg-info:`Planning Research` stage that is eventually relevant for the project is moved to the :bdg-primary:`Project folder` once this is created after PPM approval. Let's see how we would download existing data. Below, we will show three examples of downloading existing data from the :bdg-primary:`RDR` or an :bdg-primary:`External Repository`. Closed Access Collection with Repocli ========== .. _Radboud Data Repository: https://data.ru.nl/collections/di/dccn/DSC_3010000.05_519 .. _read how to do that here: https://intranet.donders.ru.nl/index.php?id=vnc00&no_cache=1&sword_list%5B%5D=tigerVNC Once you have been added to a collection, you will receive an email to the email account you used when you signed up to the Radboud Data Repository stating that you have been added. Workshop attendees will be added to our workshop's :bdg-primary:`DAC` in the :bdg-primary:`RDR` which is :bdg-warning:`Closed Access` Others can access our workshop's :bdg-warning:`Open Access` :bdg-primary:`DSC` in the `Radboud Data Repository`_ to do this exercise. We will practice downloading this DSC onto your Home Drive (replace groupname with your lab group's name and firlas with your DCCN username) 1. Establish a Network Connection to Trigon (either eduVPN or hardwired) 2. Log in on the Radboud Data Repostory * Go to https://data.ru.nl/ and click ``LOG IN`` .. figure:: RDR_login.png :figwidth: 50% :align: center * Click ``Login with your SURFconext account`` .. figure:: RDR_login2.png :figwidth: 50% :align: center * Login with the USEZ credentials of the account that has been added to the our workshop's :bdg-primary:`DAC` * After loggin in, click on the dropdown with your name, then click ``Data access credentials`` .. figure:: RDR_access.png :figwidth: 50% :align: center * Click the button to copy the password and leave this page open while you do other excercises 3. Open a TigerVNC session (`read how to do that here`_) 4. Login to the :bdg-primary:`Radboud Data Repository` * Open TigerVNC * Open the terminal application * Type ``repocli shell`` and then push ``enter`` * Type ``config`` and then push ``enter`` * Enter your RU username (u1234567@ru.nl) and then push ``enter`` * Enter the RDR password you retreived in step 2, then push ``enter`` 5. Download the Data Sharing Collection to Your Home Directory * Type ``get dccn/DAC_3010000.05_873 /home/groupname/firlas`` for workshop attendees * Type ``get dccn/DSC_3010000.05_519 /home/groupname/firlas`` for non-workshop attendees Public Collection with Cyberduck ========== 1. Establish a Network Connection to Trigon (either eduVPN or hardwired) 2. Login to the :bdg-primary:`Radboud Data Repository` * Open Cyberduck * Click ``Open Connection`` at the top left of the window * Click on the dropdown menu (which defaults to ``File Transfer Protocol (FTP)``) and select ``WebDAV (HTTPS)`` * At the ``Server:`` field type in ``data.ru.nl`` * Click the box next to ``Anonymous Login`` and then click ``Connect`` at the bottom right side of the window 3. Download the Data Sharing Collection to Your Folder of Choice * Double click on the ``dccn`` directory * Single click on the ``DSC_3010000.11_518_v1`` directory * Locate and click on ``Action`` at the top of the window and then select ``Download To`` on the dropdown menu * Select the folder you want to download the data to and then push ``Ok`` * Select the disconnect button at the top right of the window Other Repositories with Cyberduck ========== .. _Demo: https://openneuro.org/datasets/ds004564/versions/1.0.1 Let's say you have found a data collection you want to analyze on OpenNeuro called `Demo`_. Before you can go about downloading it with :bdg-dark:`Cyberduck` or :bdg-dark:`Repocli`, you must determine which protocol you must use to interact with the server. Below is a list of the Protocols required by some commonly used Open Access Repositories. .. table:: :widths: auto +------------+----------+-----------+--------+ | Repository | OpenfMRI | OpenNeuro | OMEGA | +============+==========+===========+========+ | Protocol | AWS | AWS | WebDAV | +------------+----------+-----------+--------+ 1. Establish an AWS Connection to the OpenNeuro folder you want to download * Open Cyberduck and push ``Open Connection`` * Click on the dropdown menu which defaults to ``FTP (File Transfer Protocol)`` and select ``Amazon S3`` * Click on the dropdown for more options and copy the domain name and the dataset number, seperated by a forward slash (i.e. ``openneuro.org/ds004564``) into the ``Path:`` field * In the ``Access Key ID:`` field, type ``anonymous`` .. figure:: Cyberduck_AWS.png :figwidth: 50% :align: center * Click ``connect`` 2. Download the fMRI data * Control-click the data you want to download (so the subject data folders and the ``participants.tsv`` file) * Click ``Action`` and then ``Download To`` in the dropdown * Select the directory you want to download the data to and then push ``OK``