Analyzing Data with the HPC Cluster

Now that we have a complete data set, we can analyze our data on the HPC Cluster. Let’s take look at how you can do that!

Practice Analyzing Data on the HPC Cluster

Now that we have all of the data, we will run an analysis script.

  1. Identify subjects between 1 and 10 whose data should be excluded from the analysis

  • Return to File Explorer (you should be in the /project/3010000.05/XXXXXXX.XX/raw/ directory)

  1. Begin an RStudio session

  • Click on Applications and select Terminal Emulator in the dropdown menu

  • Type rstudio and push enter

  • Push OK to take the default versions of R and RStudio

  • Push OK to take the default time and memory specifications for your job

  1. Edit your Analysis File

  • Push Control and o together once RStudio has loaded

  • Double-click project on the left panel

  • Type 3010000.05 and push enter

  • Double-click scripts

  • Double-click on the analysis.R file you uploaded

  • At the top of the script, change the outfile variable to /project/3010000.05/XXXXXXX.XX/results

  1. Select the entire document and push control and enter together to run the script

  2. Save the workspace

  • Navigate to the Environment tab at the bottom right once the script has finished running

  • Click the blue square in the Environment tab to save the workspace as, and save the workspace as RDM_Workshop_Analysis in the /project/3010000.05/XXXXXXX.XX/results folder

  1. Disconnect

To read more about how to run analyses on the HPC Cluster and to get more practice, visit these pages on the HPC wiki.