Entering edit mode
8.7 years ago
moustafa_abohawya
▴
30
I have a list of Arabidopsis Gene ID, I am interested to generate serine-threonine- tyrosine counts of residues in each one? Is there a way to do that for a large set of genes?
It is my first time to look for such aspect! I know the question may sound naive a bit!
Thanks in advance
I would use this approach. You can either download the genes of interest in FASTA format (or similar) or use a tool to automatically retrieve a sequence from NCBI (or another database) using the ID and calculate that. But BioPerl/Biopython both have sequence parsing methods that you can build on to do the counting.