Biological Explanation Of Sequence Based Features : Dipeptides
1
0
Entering edit mode
12.5 years ago
User 1933 ▴ 360

I am wondering, if there is any biological explanation of dipeptides or k-mers ?! what chemo-physical properties they potentially have and how one can interpret them ? For example, if in a protein property prediction, the model comes up with a list of dipeptides, as a predictive features, what they really mean ?!

Also, I have the same question regarding CTD (Composition, Transition and Distribution) of amino acids introduced by this paper

sequence prediction feature • 2.4k views
ADD COMMENT
0
Entering edit mode
12.5 years ago
Cliff Beall ▴ 480

They don't have biological relevance per se, except that biological sequences carry information and they are ways to break down that information to achieve things computationally - like using kmers to align, bin or assemble DNA sequences.

Of course each amino acid has certain chemical properties, so dipeptides also have certain properties.

As far as the paper goes, I wouldn't pay too much attention to 17 year old papers on protein folding - I'm not an expert, but I am pretty sure the field is in a totally different place.

ADD COMMENT
0
Entering edit mode

Since it is always interesting to come up with an biological story about the computational investigation results, may I ask you, to elaborate a little bit more about, how one can get more insight about the ultimate property of proteins with respect to kmers/dipeptides ?

ADD REPLY
0
Entering edit mode

This would take too much space for this forum, there are 20 amino acids with different properties (so 400 dipeptides) - you can find more info in a general Biochemistry textbook like here: http://www.ncbi.nlm.nih.gov/books/NBK21154/

ADD REPLY
0
Entering edit mode

I am asking about biological explanation of Dipeptides :)

ADD REPLY

Login before adding your answer.

Traffic: 1833 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6