Question

Aggregation of bulk RNA-seq

0

Entering edit mode

2.9 years ago

fifty_fifty ▴ 90

I have 10 fastq files from RNA sequencing, I need to do differential analysis between two groups of samples. Each group has 5 patients. What would be the best pipeline for this? Do I align each fastq to the genome and then somehow assemble the 5 resulting files? I suppose it should be straightforward.

RNA-seq • 912 views

ADD COMMENT • link updated 2.9 years ago by Marco Pannone ▴ 810 • written 2.9 years ago by fifty_fifty ▴ 90

0

Entering edit mode

Basic question before you begin - do you have 10 files or 10 pairs of files? Make sure you have sequencing information for all your samples.

Look into a simple pseudo-count + DESeq2 or STAR/RSEM + DESeq2 pipeline to go from raw sequence to counts to DE analysis.

ADD REPLY • link 2.9 years ago by Ram 45k

0

Entering edit mode

Always do some exploratory analysis (such as PCA) to assess whether your biological replicates belonging to the same sample group cluster well together before eventually merging them. Also when you perform DE analysis downstream (with DESeq2 for example), always provide a count matrix including every single biological replicate. Based on the experiment design file (often referred as coldata) DESeq2 can carry appropriate DE analysis between two sample groups and perform appropriate statistics.

ADD REPLY • link 2.9 years ago by Marco Pannone ▴ 810

score 1 · Answer 1 · 2022-05-11

1

Entering edit mode

2.9 years ago

ATpoint 87k

You can follow existing workflows such as https://bioconductor.org/packages/release/workflows/vignettes/rnaseqGene/inst/doc/rnaseqGene.html

ADD COMMENT • link 2.9 years ago by ATpoint 87k