Entering edit mode
5.1 years ago
hafiz.talhamalik
▴
350
I have a plant RNA-Seq data and I want to extract the transcript sequence of the some transcripts. How can I do that ??
@ hafiz.talhamalik
Is this plant genome already sequenced and annotated? If so, go for reference based assembly (with reference sequence) and post alignment, you can extract transcripts (sequence) of interest using annotation file. For aligning RNAseq data, use splice aware aligners such as hisat2, star.
yes it's known one Brassica Napus. I tried reference based assembly using trinity but assembly results were not good..!! any other good assembler ??
I guess Trinity is for de novo assembly. Since reference genome and annotation are available, use reference based assembly. Authors in this paper (on B. napus transcriptome) https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-015-2062-7 used tophat. I would suggest hisat2 or star. You can use reference genome and annotation from here: https://plants.ensembl.org/Brassica_napus/Info/Index @ hafiz.talhamalik
You can extract transcript sequences directly from ensembl plants biomart, you don't need RNA-seq data for that. In case it is transgenic with fused transcripts or something like that, then you can use blast to extract reads from fastq file and then assemble them.
ok let me try that.. and any idea of tool for annotation of plant assembly ?
MAKER-P is good.