Build Folder Structures From Sample Sheet for rcbio NGS Workflows

 


Note: You can copy and paste all the text to your Linux command line to run. Anything with "#" is comment, and will be IGNORED by Linux.

Login to O2

# replace user123 with your eCommons ID ssh user123@o2.hms.harvard.edu



Start an interactive session

# this command requests a job in the interactive partition, with one processor for 2 hours srun --pty -p interactive -t 0-02:0:0 --mem 2000MB -n 1 /bin/bash



Make a folder to work in

# Make a directory in scratch file system and work there. We recommend creating separate folders for each project. mkdir -p /n/scratch/users/${USER:0:1}/$USER/test && cd /n/scratch/users/${USER:0:1}/$USER/test



Note: Each user has 25 TiB /n/scratch space. There is no backup for data saved in/n/scratch, and files will be deleted if they are not accessed for a month. You can read more about /n/scratch on the Filesystems page.

Copy testing sample sheet to work directory

Examine sample sheet on local computer

The sample sheet is in Microsoft Excel format. You can look at this file by transferring it to your local computer, and opening in Excel. Programs that can be used to transfer the sample sheet include Filezilla or WinSCP. For help on transferring files to or from the O2 cluster, please read the File Transfer wiki page. 

You can also use our new O2Portal to do file transfer: https://o2portal.rc.hms.harvard.edu/pun/sys/dashboard



Build folder structure from the sample sheet





Look at the folder structure



Now you are ready to run an rcbio workflow

To instead run workflow on your own data, transfer the sample sheet to your local machine following this wiki page and modify the sample sheet. Then you can transfer it back to O2 under your account, then go to the build folder structure step.