rsem-1.3.3 with bowtie-2.5.4
Start an interactive job, with a walltime of 2 hours, 2000MB of memory.
srun --pty -p interactive -t 0-02:0:0 --mem 2000MB -n 1 /bin/bashCreate a working directory on scratch and change into the newly-created directory. For example, for user abc123, the working directory will be
mkdir /n/scratch/users/a/abc123/rsem
cd /n/scratch/users/a/abc123/rsemClone rcbio and setup path, then copy some test data. For details to use your own data, please visit this page: Build Folder Structures From Sample Sheet for rcbio NGS Workflows
# This will setup the path and environment variables for the pipeline
git clone https://github.com/ld32/rcbio.git $HOME/rcbio
export PATH=$HOME/rcbio/bin:$PATH
cp /n/shared_db/misc/rcbio/data/fruitFlyFastq/sampleSheet.xlsx .
buildSampleFolders.sh sampleSheet.xlsx
Copy the example rsemBowtie2 bash script:
cp $HOME/rcbio/bin/rsemBowtie2.sh .
Now you can modify the options as needed. Please reference the http://deweylab.github.io/RSEM/ if you have any questions.
To edit the Bowtie2 and RSEM bash script:
nano rsemBowtie2.shTo test the pipeline run the following command. Jobs will not be submitted to the scheduler.
runAsPipeline "rsemBowtie2.sh -r mm10" "sbatch -p short --mem 6G -t 2:0:0 -n 1" useTmp
# this is a test runTo run the pipeline:
runAsPipeline "rsemBowtie2.sh -r mm10" "sbatch -p short --mem 6G -t 2:0:0 -n 1" useTmp run 2>&1 | tee output.log
# notice here 'run 2>&1 | tee output.log' is added to the commandTo understand how 'runAsPipeline' works, how to check output, how to re-run the pipeline, please visit: Run Bash Script As Slurm Pipeline
Now you are ready to run an rcbio workflow
To instead run the workflow on your own data, transfer the sample sheet to your local machine following this wiki page and modify the sample sheet. Then you can transfer it back to O2 under your account, then go to the build folder structure step.