Differential Abundance

This nf-core pipeline does differential abundance analysis for feature/ observation matrices from platforms such as RNA-seq.

Please follow the instructions below to use our analysis pipeline and run it on your RNASeq data:

Prerequisites:

CCBB assumes that you have

The outcome for your RNAseq data from CCBB bulk RNAseq analysis pipeline is required for running the CCBB Differential Abundance analysis pipeline.

Please follow the instructions below to get access to our analysis pipeline.

Step1:

Preparing input files on Garibaldi for running CCBB differentialabundance pipeline

  • Please make sure you have the following files generated by the CCBB bulk RNAseq pipeline, prior to running the differentialabundance pipeline:
    Files “salmon.merged.gene_counts_length_scaled.tsv” and “salmon.merged.gene_lengths.tsv” are available at the location /full-path-to-bulk-RNAseq-folder-on-Garibaldi/results/salmon_star/
  • In the same folder, generate “samplesheet_de.csv” and “contrasts.csv” files based on the examples provided.
    Edit them to reflect your sample name, /full-path-to-bulk-RNAseq-folder-on-Garibaldi data files (fastqs), group to which the sample belongs (a single word that describes the group), replicate number of the sample (1, 2, 3 …)

The RNA-seq and differentialabundance workflows expect the following folder/file structure:

CCBB_DIFFERENTIAL_ABUNDANCE_PIPELINE_FOLDER_STRUCTURE

Step2:

Now you are ready to run the CCBB differentialabundance pipeline!

Follow the instructions below to begin:

Go to the Workflow site on your browser: (Refer to the image at the end of this document for details)

  • Click on “Experiments”
  • To select a workflow click on “My WorkFlows” (on the top right corner), then click on “New Experiment”
  • Click on “CCBB_DIFFERENTIALABUNDANCE_WF”
  • Choose “Garibaldi” from the dropdown menu for “Execution profile:” (top of the page)

The workflow “CCBB_DIFFERENTIALABUNDANCE_WF” needs three inputs as defined in below.

  • Click on the “Input Data” tab. In the text box,
    • (a) type in the full path to your folder on Garibaldi where you performed bulk RNA-seq analysis using our workflow,
    • (b) type in the full path to your folder on Garibaldi where you generated the files for your dataset using instructions in Step1: “samplesheet_de.csv” and “contrast.csv”.
    • (c) type in the full path to where you want the results of the differentialabundance analysis to be generated on Garibaldi , then click the run button CCBB BULKRNASEQ PIPELINE WF_RUN to run “CCBB_DIFFERENTIALABUNDANCE_WF”.

Upon completion, check to see the sub-folder “DE_results” on Garibaldi at the location you specified. (example: /gpfs/home/johndoe/…/differential_abundance_analysis/DE_results)

Follow the details to understand the pipeline outcome for your data.

Step3:

Post analysis follow-up for users. Please follow the instructions that are sent via email upon pipeline completion:

  • Contact CCBB for any questions regarding the pipeline, results of analysis, further assistance with downstream analysis (like pathway analysis, custom differential expression analysis, specific plots for publication etc.).
    Please take a few minutes to give us your feedback.

NOTE

If you need custom work to be done (e.g. specific model matrix to be used for a comparison to account for known batch effects) request for a meeting with CCBB. The service hours will be determined by CCBB.

DISCLAIMER

CCBB is providing pipelines on Garibaldi, the institute’s shared linux cluster, on “as is” and “as available” terms, solely for the benefit of the scientific community at Scripps Research. CCBB reserves the right to continue/discontinue any of its pipelines on Garibaldi.

CCBB DIFFERENTIAL_ABUNDANCE PIPELINE WF