Entering edit mode
4.4 years ago
What technique does proteomic software use to predict if separate sequences originate from the same protein?
In some of the proteomics data I have worked with, preprocessing steps included the ability to differentiate peptides representing unique proteins, from (multiple) peptides coming from the same protein. I am wondering what the name is for the technique that does this step of proteomic data processing.
I assume you're talking about mapping peptides?
They are simply mapped back to a known set of reference proteins. If they are sufficiently long, they will map uniquely (assuming that sequence only occurs once in the reference set). Shorter or ambiguity containing sequences might hit multiple reference sites, in which case there are a few different strategies to decide how you count these.