Hi,
Does anyone of a script to convert a gff3 file into a format that PROmer's mapview can accept for a bacterial genome? mapview's expectation is for eukaryotic genomes, and so it's looking for a gff file like this
D_melanogaster_2Rslice Dpseudo initial-exon 16918 16938 . + . 2R_CG14752.1
D_melanogaster_2Rslice Dpseudo internal-exon 17371 17432 . + . 2R_CG14752.1
D_melanogaster_2Rslice Dpseudo last-exon 17913 18168 . + . 2R_CG14752.1
My gff3 looks like this, which mapview is understandably not accepting.
BX571856.1 EMBL gene 36132 36371 . + . locus_tag=SAR0026
BX571856.1 EMBL CDS 36132 36368 . + 0 locus_tag=SAR0026;note=No significant database matches. Doubtful CDS;transl_table=11;product=hypothetical protein;protein_id=CAG39054.1;db_xref=GI:49240408;db_xref=UniProtKB%2FTrEMBL:Q6GKR9;exon_number=1
BX571856.1 EMBL start_codon 36132 36134 . + 0 locus_tag=SAR0026;note=No significant database matches. Doubtful CDS;transl_table=11;product=hypothetical protein;protein_id=CAG39054.1;db_xref=GI:49240408;db_xref=UniProtKB%2FTrEMBL:Q6GKR9;exon_number=1
BX571856.1 EMBL stop_codon 36369 36371 . + 0 locus_tag=SAR0026;note=No significant database matches. Doubtful CDS;transl_table=11;product=hypothetical protein;protein_id=CAG39054.1;db_xref=GI:49240408;db_xref=UniProtKB%2FTrEMBL:Q6GKR9;exon_number=1
I could extract all the lines like the first, but what I don't know how to do is persuade mapview to accept something that doesn't have the initial-exon, internal-exon,last-exon structure (incidentally, how are single-exon genes represented in this scheme?)
I guess I'm not the first to try use mummer/promer/mapview etc for bacterial work so there must be a solution out there somewhere? Sure hope so - I'm kindof new to this so would be happy to hear suggestions for how to go about this. Thanks M