Entering edit mode
8.7 years ago
RT!37
▴
20
From a group of paralogs, how to select representative member in order to prevent overrepresentation in my dataset.
From a group of paralogs, how to select representative member in order to prevent overrepresentation in my dataset.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
What do you mean by "overrepresentation"? If there is a chance that they have functionally evolved then you will need to consider them all unless you know a duplication is recent and has not had time to evolve.
Thanks
Hi, From over-representation I mean , being a slight difference in sequence but similar function proteins occurrence multiple times. For my study, i want to exclude such proteins. The duplication event is recent or early can be known only by looking divergence of function (neofunctionalization). But since large number of proteins are there in dataset, manually selecting based on function is not an option. Is there any resource or threshold value to decide the representative member from a group of paralogs proteins.