statistical analyses for transcriptomes
0
0
Entering edit mode
5.7 years ago
luzglongoria ▴ 50

hi there,

I need to do some statistical analyses of my transcriptomes.

I have a database with 4 columns (gene ID (categoric), expression level (numeric), individual, species). I have 2 different species and 5 ind per species. Per each individual I have more than 20000 genes (some of them are more expressed than others).

What I want to know is whether is there differences between the expression level between species. The distribution of my data doesn't follow a Gaussian distribution.

For analysing my data I run:

 wilcox.test(Exp~Species, data =data)

and then,

Wilcoxon rank sum test with continuity correction
data:  Exp by Species
W = 8573700000, p-value < 2.2e-16
alternative hypothesis: true location shift is not equal to 0

According to these result there should be a significan difference in the expression level between species.BUT:

  1. I am not sure if the analyses are appropiate for this dataset.
  2. Is there any way where I can take into account (as a random factor) the ID gene?

Thank you so much in advance

RNA-Seq R Statistical analyses • 1.1k views
ADD COMMENT
2
Entering edit mode

The distribution of my data doesn't follow a Gaussian distribution.

It probably follows the negative binomial. This is normal and expected. Check out the common differential analysis pipelines, such as DESeq2, edgeR or limma/voom. All are well-documented.

ADD REPLY
0
Entering edit mode

which type of expression data you have (rna-seq, microarray, etc...) ?

ADD REPLY
0
Entering edit mode

Sorry, I didn't say. It is RNA-seq

ADD REPLY
1
Entering edit mode

I think you are looking for differential expression analysis. Check the Bioconductor 2018 Workshop chapter 6 and 7 for more details.

ADD REPLY

Login before adding your answer.

Traffic: 2604 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6