Hi - I was wondering if it is possible the download a random sample of proteins from a given protein database. I want to do this to compare proteins of interest to "background proteins". i.e. a control. Probably a little trickier would be to download proteins that aren't of a certain type i.e. non membrane proteins.
Has anyone done anything like this. I see in papers all the time "we used non-XXX proteins as a negative training set. " And I'd imagine something like this would be a pain to do manually.
Ideally I would not like to download entire databases, but rather do this task online.
Anyone done this sort of thing?
What do you mean by downloading a protein?
AFAIK it is hard to transport amino acids over http.
sorry my bad - fasta files.
Hi Pierre thanks for this, trying out now but getting "ERROR 2003 (HY000): Can't connect to MySQL server on 'genome-mysql.cse.ucsc.edu' (113)"
Probably a firewall issue with my campus so I'll let you know how I get on.
You should edit your question, add something like Edit 1 at the end of it with your progress. Not as an answer (?) Or post a new question if really that doesn't work.
this should be a comment, not an answer. And, yes, it is a problem with the firewall