how to select longest isoform per gene in gtf
2
0
Entering edit mode
7.6 years ago
reza ▴ 300

hi everyone

i downloaded a gff file for a vertebrate and converted it to gtf format using gffread. i want to use the gtf file to creat database for SnpEff. Now i am going to choose and maintain single isoform per gene (longest isoform) in gtf file.

anyone can advise me to do this?

thanks in advance

gene GTF annotation • 7.3k views
ADD COMMENT
5
Entering edit mode
7.6 years ago
A. Domingues ★ 2.7k

Using the cgat toolset it can be accomplished with:

cgat gtf2gtf --method=filter --filter-method=longest-transcript 

It is a very rich set of tools, so have a look at what else it does because it will make your like easier.

ADD COMMENT
0
Entering edit mode
7.6 years ago
EagleEye 7.6k

Hi, the following script is used to get longer gene/isoform (start and end) from GTF. You can modify this script to solve your task,

ADD COMMENT

Login before adding your answer.

Traffic: 1554 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6