Pipeline install bundle (docker-free)
An experimental .tar.gz bundle to install the Rhapsody Sequence Analysis pipeline, without the use of a docker image.
This contains cwl-runner and all the major required dependencies. Simply download, extract, and run.
Linux only. Currently tested Linux versions:
- Ubuntu 16.04 / 20.04 / 22.04
- Red Hat 7
- CentOS 7 / 9
Instructions
Extract the tar.gz bundle and enter that folder:
tar -xvzf rhapsodyPipeline-[version].tar.gz
cd rhapsodyPipeline-[version]
To run the BD Rhapsody Sequence Analysis Pipeline:
First, define the input files and pipeline parameters in a pipeline_inputs.yml file. See the included 'pipeline_inputs_template.yml' for instructions on formatting the YML file. Then, start the pipeline with this command:
./rhapsody pipeline --outdir results_dir pipeline_inputs.yml
To run a small test of the BD Rhapsody Sequence Analysis Pipeline with built-in demo data:
./rhapsody pipeline --outdir test_results test_files/test_smallDemo.yml
By default, the pipeline allows parallel node execution. To turn this off, use the --no-parallel option:
./rhapsody pipeline --no-parallel pipeline_inputs.yml
Any cwltool option can be added to the pipeline command. The YML file should always be the last argument. Examples are --outdir, --tmpdir-prefix, --leave-tmpdir.
./rhapsody pipeline --leave-tmpdir --outdir /path/to/directory pipeline_inputs.yml
Extra Utilities:
See additional instructions: Extra Utilities
Given pairs of R1/R2 FASTQ files from Rhapsody libraries, only annotate the cell label and UMI of R1 and put it in the header of R2:
./rhapsody annotateCellLabelUmi inputs.yml
To create a new Reference Archive for use as an input to the Rhapsody Sequence Analysis Pipeline:
./rhapsody makeRhapReference inputs.yml
Example inputs YML:
Genome_fasta:
- class: File
location: "./GRCh38.primary_assembly.genome.fa.gz"
Gtf:
- class: File
location: "./gencode.v45.primary_assembly.annotation.gtf.gz"
Extra_sequences:
- class: File
location: "./gfp.fasta"
WTA_Only: True
To determine the amount of phiX contamination for a fastq file:
./rhapsody phiXContamination inputs.yml
Feedback welcome!