differential expression analysis of RNASeq data in different species
2
0
Entering edit mode
9.8 years ago
jgrant ▴ 10

Hello,

I am fairly new to this sort of analysis, and have a question - I am picking up a project where they are comparing expression in two fairly closely related nematode genera. We have RNASeq (Illumina) data from the two taxa. When the project was begun, only one of the worms had a published genome. They used Tophat/Cufflinks with that genome to analyze the RNASeq data for both genera.

It seems, at least to my novice eyes, that using the reference genome of one taxon to analyze could be problematic - is Tophat able to map reads from one taxon to the genome of another correctly?

There is now a genome for the second taxon, and I can use that to map the reads in Tophat, but how, then, would I compare the output of the two taxa, to find genes that are differentially expressed? Is it possible to build a reference that combines the genomes of both taxa, and somehow pulls orthologs that are found in both?

Any help is appreciated!

Jessica

rna-seq • 2.8k views
ADD COMMENT
0
Entering edit mode
9.8 years ago

In my opinion the ideal course of action would be to characterize each transcriptome independently and establish groups of genes that appear to express at different levels within the genomes.

Then compare these groups of genes between one another either by similarity of their sequences or some their known functionality.

The other options are far less attractive because aligning data from one species to the the genome of the other may produce more severe artifacts.

That being said most of the time success in publishing the results depend on the strength of the results and your ability to come up with an interesting finding, so if one method does not seem to work explore the other.

ADD COMMENT
0
Entering edit mode

Thank you! Am I correct in thinking there isn't a gold standard method for this sort of analysis? I was looking for papers, but didn't see much out there...

ADD REPLY
0
Entering edit mode

Frankly there is no gold standard even for straight up RNA-Seq analysis - there are only methods that are acceptable (today) yet do not guarantee finding the ground truth.

But moreso there can't really be a standard analysis for your case as the dissimilarity between species can be extremely variable and unexpected. I would try both methods and see what works best then make a case for it.

ADD REPLY
0
Entering edit mode

Thanks for your input!

ADD REPLY
0
Entering edit mode
9.8 years ago
Manvendra Singh ★ 2.2k

I think for your first question you follow Istvan Albert

For cross taxa mapping, you can

Map the reads on both genomes (two taxa of nematode)

Take only those reads for further analysis which mapped on both genomes, for the estimation of gene expression in their respective species.

Maybe you consider uniquely mapped reads or one allignment per read which are mapped on both genomes

This way the biasness of insertions and deletions of sequences between the genomes are removed and moreover, you get orthologous regions from your cross-taxa mapped reads.

ADD COMMENT

Login before adding your answer.

Traffic: 2126 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6