Hi,
Im trying to retrieve the Human Uniprot/Trembl release 2014_09 as a fasta file, which is supposed to contain about 86.000 sequences.
Unfortunately, when I look into previous releases in the uniprot ftp server, I only find the extraordinary large .dat file containing all sequences of all species. So that I can't even parse out the human entries without a memory error.
When I use the "date of" filter of the uniprot web interface, I find more than 86.000 unreviewed sequences. And I am not sure which starting date I am supposed to choose. However choosing from 01.01.2002 to 01.09.2014 already results in more than 100.000 unreviewed and only 10.000 reviewed sequences.
Is there a way to access this release in an easy way?
Thanks for your help, Leon