Question

Identifying full length transcripts in a transcriptome

0

Entering edit mode

8.9 years ago

EVR ▴ 610

Hi,

Is there any way to find full length and in-complete transcripts that has been assembled from a de novo transcriptome?

Any guidance would be appreciated.

Thanks in advance

de-novo-Transcriptome full-length-transcripts • 2.7k views

ADD COMMENT • link updated 2.3 years ago by Ram 44k • written 8.9 years ago by EVR ▴ 610

2

Entering edit mode

The CEGMA benchmark (PMID: 17332020) evaluates how many core eukaryotic genes have been assembled partially or full length. A similar tool seems to be BUSCO. I am not sure if this can be generalized beyond core single-copy genes.

ADD REPLY • link updated 4.9 years ago by Ram 44k • written 8.9 years ago by trausch ★ 1.9k

0

Entering edit mode

Do you mean a way to calculate de length of sequences in a multifasta file?

ADD REPLY • link 8.9 years ago by iraun 6.2k

0

Entering edit mode

I suspect Tom instead wants a way to see if the assembled contigs are complete/full or not.

ADD REPLY • link 8.9 years ago by Devon Ryan 104k

0

Entering edit mode

Like Devon Ryan said, I want to find whether assembled contigs in a the transcriptome is complete or missing some parts? Can we BLAST the query transcript and check and how good is the coverage for the best target hit. And later add the missing region from the target best hit which is not present in query? is my approach is right? Kindly guide me

ADD REPLY • link 8.9 years ago by EVR ▴ 610

score 1 · Answer 1 · 2016-03-18

1

Entering edit mode

8.7 years ago

JC 13k

Run Triannotate https://trinotate.github.io/ and check how many are complete or not (the fasta header has a tag for that)

ADD COMMENT • link 8.7 years ago by JC 13k

0

Entering edit mode

You mean to say "Transdecoder" cos trinotate annotates whiel transdecoder predicts the protein coding transcripts based on ORF

ADD REPLY • link 8.7 years ago by EVR ▴ 610