Question

Extract Transcript sequence from fastq file (Plant RNA-Seq data)

0

Entering edit mode

5.4 years ago

hafiz.talhamalik ▴ 350

I have a plant RNA-Seq data and I want to extract the transcript sequence of the some transcripts. How can I do that ??

RNA-Seq • 1.7k views

ADD COMMENT • link updated 5.4 years ago by Kristoffer Vitting-Seerup ★ 4.1k • written 5.4 years ago by hafiz.talhamalik ▴ 350

1

Entering edit mode

@ hafiz.talhamalik

Is this plant genome already sequenced and annotated? If so, go for reference based assembly (with reference sequence) and post alignment, you can extract transcripts (sequence) of interest using annotation file. For aligning RNAseq data, use splice aware aligners such as hisat2, star.

ADD REPLY • link 5.4 years ago by cpad0112 21k

0

Entering edit mode

yes it's known one Brassica Napus. I tried reference based assembly using trinity but assembly results were not good..!! any other good assembler ??

ADD REPLY • link 5.4 years ago by hafiz.talhamalik ▴ 350

2

Entering edit mode

I guess Trinity is for de novo assembly. Since reference genome and annotation are available, use reference based assembly. Authors in this paper (on B. napus transcriptome) https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-015-2062-7 used tophat. I would suggest hisat2 or star. You can use reference genome and annotation from here: https://plants.ensembl.org/Brassica_napus/Info/Index @ hafiz.talhamalik

ADD REPLY • link 5.4 years ago by cpad0112 21k

0

Entering edit mode

You can extract transcript sequences directly from ensembl plants biomart, you don't need RNA-seq data for that. In case it is transgenic with fused transcripts or something like that, then you can use blast to extract reads from fastq file and then assemble them.

ADD REPLY • link 5.4 years ago by ashish ▴ 680

0

Entering edit mode

ok let me try that.. and any idea of tool for annotation of plant assembly ?

ADD REPLY • link 5.4 years ago by hafiz.talhamalik ▴ 350

0

Entering edit mode

MAKER-P is good.

ADD REPLY • link 5.4 years ago by ashish ▴ 680

score 0 · Answer 1 · 2019-11-12

0

Entering edit mode

5.4 years ago

Kristoffer Vitting-Seerup ★ 4.1k

If you have RNAseq you can also do a guided transcriptome assembly - basically look for transcripts in your data that does not appear in the annotation database - you can read more about considerations and tools here.

ADD COMMENT • link 5.4 years ago by Kristoffer Vitting-Seerup ★ 4.1k