differential expression analysis without genome
1
0
Entering edit mode
5.2 years ago

Dear all,

When I did the differential expression analysis for the RNA seq data (no reference genome), after Trinity assembly to got Trinity.fasta, the next step should be align_and_estimate_abundance.pl to map the reads for each samples to the Trinity.fasta file. But the question is that is we need to do cd-hit to remove the redundancy for the Trinity.fasta first, and then do align_and_estimate_abundance.pl ?

Thank you!

RNA-Seq • 1.1k views
ADD COMMENT
0
Entering edit mode
5.2 years ago
h.mon 35k

But the question is that is we need to do cd-hit to remove the redundancy for the Trinity.fasta first

One could, but there are probably better options to reduce redundancy from the assembly - have a look at the Trinity FAQ. I would consider first:

Using the supertranscripts method, which will produce a genome-like gene representation of the transcriptome assembly, you can then follow up with Differential Transcript Usage via SuperTranscripts.

You can also use the Trinity.fasta.gene_trans_map generated by Trinity to get "gene" counts in addition to the transcript counts.

After quantifying transcript abundance, filtering transcripts with low counts (can be applied in conjunction with the two methods above).

ADD COMMENT
0
Entering edit mode

Thanks for the helpful reply!!

ADD REPLY

Login before adding your answer.

Traffic: 1800 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6