How To Obtain Intron And Utr Info From Gff File?
0
1
Entering edit mode
13.0 years ago
Flo ▴ 10

Hi, I have a gff file like this: (example for one gene)

scaffold_1      JGI     start_codon     6757    6759    .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"
scaffold_1      JGI     exon    6972    7023    .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     6972    7023    .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 2
scaffold_1      JGI     exon    7099    7169    .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     7099    7169    .       +       1       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 3
scaffold_1      JGI     exon    7306    7344    .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     7306    7344    .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 4
scaffold_1      JGI     exon    8316    8366    .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     8316    8366    .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 5
scaffold_1      JGI     exon    8741    8785    .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     8741    8785    .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 6
scaffold_1      JGI     exon    9166    9207    .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     9166    9207    .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 7
scaffold_1      JGI     exon    9458    9514    .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     9458    9514    .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 8
scaffold_1      JGI     exon    10143   10229   .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     10143   10229   .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 9
scaffold_1      JGI     exon    10388   10432   .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     10388   10432   .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 10
scaffold_1      JGI     exon    10864   11308   .       +       .       name "estExtDG_fgenesh_newKGs_kg.C_10006"; transcriptId 431653
scaffold_1      JGI     CDS     10864   10947   .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"; proteinId 431653; exonNumber 11
scaffold_1      JGI     stop_codon      10945   10947   .       +       0       name "estExtDG_fgenesh_newKGs_kg.C_10006"

I would like to recover the utr and intron positions from this. Any easy way to do this?

Thanks!

intron gff utr • 6.5k views
ADD COMMENT
1
Entering edit mode

Are the UTRs annotated in your file? I don't see any in column 3 of your example. If not, you cannot recover them; you can only retrieve intergenic positions.

ADD REPLY
0
Entering edit mode
ADD REPLY
0
Entering edit mode

Yes I'm trying with genometools but can't get the utrs Any idea?

ADD REPLY

Login before adding your answer.

Traffic: 1959 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6