I downloaded the trembl database from uniprot website. But it does not seem to contaln all the protein sequences. I am wondering which database contains all entries (RefSeq?) and how to download a fasta of it.
I downloaded the trembl database from uniprot website. But it does not seem to contaln all the protein sequences. I am wondering which database contains all entries (RefSeq?) and how to download a fasta of it.
You need to clarify what you mean by 'all protein sequences'. Which ones are missing from trEMBL? In which species?
If you genuinely are after all RefSeq sequences then you're not going to find them in trEMBL. Try RefSeq at NCBI: http://www.ncbi.nlm.nih.gov/refseq/
You can download the protein (or nucleotide) sequences in FASTA format for vertebrates and other eukaryotic species from Ensembl:
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.