How to test the quality of RNA-seq gene expression data ?
2
2
Entering edit mode
10.2 years ago
jack ▴ 980

Hi,

I'm looking for the quality control test of RNA-seq gene expression data. I have just expression values, not the sequence. which kind of test I can think of about it to have feeling about the quality of data?

I found the similar thing for microarray gene expression data.http://res.illumina.com/documents/products/technotes/technote_gene_expression_data_quality_control.pdf

quantification sequencing RNA-Seq genome • 4.4k views
ADD COMMENT
2
Entering edit mode
10.2 years ago

If you have several biological replicates - build scatterplots among them and compute correlation. You can also consider analyzing gene expression value distribution for "abnormalities" (e.g. with Q-Q plot), in this case you need some sample that is made using relatively same techniques and use the same expression value calculation scheme (i.e. TopHat -> FPKM -> log10).

ADD COMMENT
1
Entering edit mode
10.2 years ago

A PCA or hierarchical clustering are pretty much your only options.

ADD COMMENT
1
Entering edit mode

Is a correlation plot generated between biological replicates would not be enough?

ADD REPLY
2
Entering edit mode

If you have a lot of samples, then the correlation plots end up getting rather large and difficult to interpret, that's when PCA plots and such become a bit simpler.

ADD REPLY
0
Entering edit mode

Interesting, but what PCA plot will tell me about the quality of data ? For PCA, my features will be the biological replicates?

ADD REPLY
2
Entering edit mode

It'll tell you about the underlying variance structure. If you have samples clustering in odd ways, then there are likely batch effects on sets of genes.

ADD REPLY

Login before adding your answer.

Traffic: 1437 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6