RNA-seq biological replicates
2
0
Entering edit mode
7.1 years ago
amy16 ▴ 40

I've got PE Illumina Hi-seq RNA-seq data. I trimmed the adapters and then aligned the reads to the reference genome. Now before proceeding towards transcript assembly and quantification, I would like to know how to screen down which of the three biological replicates should I take forward for further analysis?

RNA-Seq • 2.0k views
ADD COMMENT
1
Entering edit mode
7.1 years ago

Hey,

The question is somewhat strange because it implies that you decided to use replicates with the outlook that 1 or more of them would fail(?) I'm not sure that I would spent a couple of hundred pound sterling GBP or ~!0,000 Rupee if I was later going to decide to ditch 1 or more of the samples.

If you've used a Hi-Seq and the laboratory personnel is experienced, then I imagine that you can any of the replicates.

Procedures that most people do with replicates:

  • process them as separate samples and then, after normalization, check how they line up on PC1 vs. PC2 via principal components analysis
  • average counts over the replicates post normalisation (this was more common in cDNA microarray analysis)
  • concatenate the raw data FASTQ files together (zcat piped into gzip) and then process them as a single sample

You mention assembly, so, I would concatenate your samples together and then do the de novo transcriptome assembly on the concatenated sample. Whilst saying this, all transcriptome assemblers that I've used allow you to specify multiple samples at the command line and then it merges them together anyway.

If any of the samples 'failed', I doubt that you'd have the data in hand right now. You should be able to confirm the basic quality of the samples by contacting the lab that did the sequencing, or just check the reports that they sent.

Good luck, Kevin

ADD COMMENT
0
Entering edit mode

Please accept my colleague's answer too. We give the same answer.

ADD REPLY
2
Entering edit mode
7.1 years ago
GenoMax 147k

All three. If you have enough reads in all three then you could assemble them independently and then merge the data to create a more comprehensive representation of the transcriptome.

ADD COMMENT
1
Entering edit mode

Got there while I was writing the answer. You're the master at comments!

We more or less give the same advice/

ADD REPLY

Login before adding your answer.

Traffic: 2344 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6