When I download transcript data from Refseq via their FTP site (ftp://ftp.ncbi.nlm.nih.gov/refseq/), I noticed that there are two file types: .gbff & .fna. Is it correct to assume that the .gbff (Gene Bank Flat... I believe) file contains EXACTLY the same sequence information as the .fna file (FASTA format sequences) in the same order, except that the .fna file has only short one-line descriptions for the sequences?
Also, what are the possible last 'words' in the ">..." title for each sequence in the .fna file? I've seen for example 'mRNA' and 'ncRNA', and so forth. Is there a fixed and standardized list by chance?