How to get lncRNA seqences from an annotation file?
1
0
Entering edit mode
8.0 years ago

Hi, I'm very new to bioinformatics in general so please excuse my ignorance. I'm trying to get sequences identified as long noncoding RNA in the mouse genome. I have a gtf file of only lncRNA annotations from Gencode, how do I get the actual sequences using this file? Any help would be greatly appreciated. Thank you.

RNA-Seq genome • 2.5k views
ADD COMMENT
0
Entering edit mode

try gffread

gffread -w transcripts.fa -g /path/to/genome.fa transcripts.gtf

ADD REPLY
0
Entering edit mode

are you analysing lncRNA from RNAseq? by using hisat or tophat?

ADD REPLY
0
Entering edit mode
8.0 years ago
Satyajeet Khare ★ 1.6k
  1. Download reference genome in fasta format (same version as gtf file)
  2. Use bedtools getfasta as follows

    bedtools getfasta -fi reference_genome.fa -bed lncRNA.gtf -fo lncRNA.fa

ADD COMMENT

Login before adding your answer.

Traffic: 1891 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6