Can anyone please help me with handling Genbank .gbf files? Recently I've generated Sequin (.sqn) files and Genbank (.gbf) files which I don't know what to do with them to obtain the accession IDs of the translated nucleotide sequences such that I know the names of the proteins identified. the .gbf files look something like this when I open with Notepad:
LOCUS Scaffold1 1325603 bp DNA linear 14-FEB-2013
DEFINITION No definition line found.
ACCESSION
VERSION
KEYWORDS .
SOURCE Unknown.
ORGANISM Unknown.
Unclassified.
FEATURES Location/Qualifiers
source 1..1325603
/organism="unknown"
/mol_type="genomic DNA"
gene complement(<1..555)
/locus_tag="asmbl_1"
CDS complement(<1..555)
/locus_tag="asmbl_1"
/codon_start=1
/transl_table=11
/product="tat (twin-arginine translocation) pathway signal
sequence domain protein"
/translation="MKEFHSTLSRRDFMKSLGVVGAGLGTMSAAAPVFHDLDEVTSST
LGINKNPWWVKERDFKNPTVPIDWSKVTRQPGVFQGLPRPTVADFTKAGVVGGTSTDL
ETPEMALTLYDAMAKEFPGWTPGYAGMGDTRTTALCNASKFMMFGAWPGNMEMGGKRV
NVIGAIMAAGGSPTFTPWLGPQLDT"
...
...
Does anyone here know of a software tool which I can use to make sense out of these and generate accession IDs for them?
Thank you in advance!
Not clear: this is a new .gbf, generated by you for some new sequence data? In which case there will not be any accessions or IDs; that happens after submission to GenBank and curation.