Question

How to do if no normal control for RNA

0

Entering edit mode

4.7 years ago

9521ljh ▴ 50

I have 10 patient Tumor RNA sequence.

i have somatic mutation (DNA) list. Then i want to survey whether somatic mutation effect to RNA expression.

However, there is no normal control.

is there any way to analyze?

additionally, i don't have control Methylation(idat)...

i am losing way to analyze.

next-gen RNA-Seq • 926 views

ADD COMMENT • link updated 4.7 years ago by i.sudbery 20k • written 4.7 years ago by 9521ljh ▴ 50

score 0 · Answer 1 · 2020-02-29

You could try to obtain normal / healthy RNA-seq samples by searching GEO, SRA, et cetera. However, my preference would be to process the data as normal (normalisation and transformation), and to then convert the normalised+transformed expression levels to Z-scores. Then, any gene that has Z > 1.96 or Z < -1.96 will be statistically significantly (p<0.05) increased or decreased in expression.

Kevin

score 0 · Answer 2 · 2020-02-29

0

Entering edit mode

4.7 years ago

i.sudbery 20k

The question you are interested in here, is whether each somatic mutation has an effect on the expression of the gene in question. Thus, in each case you DO have two groups of samples - wildtype samples, and mutant samples. The fact they are all cancer is immaterial.

Thus, what you are doing here is an eQTL study, rather than a differential expression study. Unfortunately 5 samples is a very small number of samples to do this with. The ideal would be to fit a limma/edgeR/deseq model for each mutation, dividing into those that have the mutation and those that don't, and check if the gene in question is differential. But if your list of mutations is more than a few, then this is going to get very tiresome, and unless you do some trickery with recalculating the mutliple testing correction, the multiple testing is going to absolutely wipe you out.

Alternatively you could perform a variance stablaising transform, like VST, rLog or (i think) voom, and then for each somatic mutation, take the estimates for that gene, divide to mutant and wildtype samples, and do a t-test.

Finally, if each mutation only occurs in one sample (which I guess is fairly likely), do something like what @Kevin Blighe suggested, but use the other 4 samples to calculate mean and SD on variance stabilized estimates. Although, again, you might struggle with only 5 samples.

ADD COMMENT • link 4.7 years ago by i.sudbery 20k

0

Entering edit mode

Hi,

I have somewhat similar question. I am an wet lab person taking my first steps in the RNAseq field. I have one SNP of interest and I have performed DGE analysis with edgeR to find genes differently expressed based on the genotype. I have detected only genes in a very close proximity to the SNP itself. Now I'm preparing the abstract and would like to use the correct terminology :) Is my edgeR analysis gripped by the selected genotype differential expression analysis or is it in fact the eqtl analysis?

Thanks!

ADD REPLY • link 4.6 years ago by anna.cot.anna.cot • 0

0

Entering edit mode

Normally you would say that the SNP is an eQTL for the gene, rather than vice-versa.

ADD REPLY • link 4.6 years ago by i.sudbery 20k

0

Entering edit mode

Yes, definitely! But if I am performing the Anova like analysis where my dependable variable is genome wide gene expression and independable variable is my SNP is it then called then differential gene expression analysis or is it eQTL analysis?

ADD REPLY • link 4.6 years ago by anna.cot.anna.cot • 0