Simple question - I want the names of all the genes that have a tyrosine residue in their respective amino acid sequence.
Bonus points - limit that to only genes having a tyrosine that is able to be phosphorylated by a tyrosine kinase (I understand these recognize some kind of motif, not just any old tyrosine).
Background: I'm trying to do a GO-style hypergeometric overrepresentation analysis of significantly differentially expressed proteins that are pulled down with an anti-phosphotyrosine antibody. An appropriate background set would not be every gene in the genome - only those containing a phosphorylatable tyrosine.
http://gps.biocuckoo.org/links.php has a lot of resources. Not sure how out of date they are though.