Hi there. I am a novice. I need to do comparative genomics. I searched and downloaded several genomes of different genus below one species. Because NCBI only choose one of them served as Refseq. I can only get the annotation file of the Refseq one. And, I can only find the assembled genomes of the rest others with very poor quality annotation files.(just keep "genbank region", "genbank region" and "genbank region", no "exon", "CDS", for example) However, the literatures which they derived from have described and discussed the annotation results. So is it because that I try to search the annotation file in a wrong way? Or I have to ask the authors for annotation files? Actually, I have emailed the authors asking for annotation files. If there is no answer. Does it mean that I have to repeat the annotation work or ortholog analysis that the authors have already done before? Thank you so much if there is an answer or some suggestions.
Thank you for telling me the information of the "RefSeq category" and ways to get the information!!! I need look into pigs. There is a RefSeq, which I am clearly confirm. However, as you say, annotation of the other pig genomes seem chosen computationally. (after I download the gff files from the Genbank, the contents look nonsense, with very poor and rough information.) Some papers told that they have finished annotating the genomes and also discussed much about their details but there seems no data accessibility to the annotation files. So I need to re-annotate the genomes right? For I am not very clear that whether all the annotation files are required to be uploaded. :)
For anything but bacteria I recommend to look at ensembl genomes as well, for example the pig genome there: Sus scrofa When you know there were groups with better annotation, the ensembl is more likely to integrate such annotation for many genomes, with exception of JGI genomes in my experience. It's always worth double checking both resources.
i got it. thank you!