The Role Of Non-Conserved Regions In Homologous Genes
1
2
Entering edit mode
12.1 years ago
Justin ▴ 470

Let's consider a hypothetical situation where I have a gene called G in humans that does function F and its homolog in mouse is gene H. On the whole, they are not conserved (represented by dashes) but there's a small portion that is conserved (represented by nucleotides) and we find that the conserved region is a binding site for a protein.

Gene G (human):  ------------------------ACCCGATCGATCGCAGT----------------
Gene H (mouse):  ------------------------ACCCGATCGATCGCAGT----------------

With that in mind, I have two questions:

  • 1) Starting from G, how do you find that H is a homolog in mouse?
    • For example, how is it done at the HomoloGene database or at Ensembl when they present you homologs of a gene? Is that done experimentally where they see the genes perform similar function in different species?

  • 2) What could be the role of the nonconserved regions?
    • Could the non-conserved sequences look utterly different yet share some 2D structure, or do you still need at least some sequence conservation? What kind of programs would be best suited to look at that?
    • Could the non-conserved region be different because you need different sequences in order to implement the same function in different species?
• 5.5k views
ADD COMMENT
2
Entering edit mode

Homologous structures are similar because of common descent not because of functional similarity (analogous), if they are similar at all. Sequence similarity is an indicator for common descent, not proof. Functional similarity is no proof for common descent.

ADD REPLY
0
Entering edit mode

Thanks, got confused with homologous and analogous.

ADD REPLY
9
Entering edit mode
12.1 years ago
VS ▴ 740

For your first question, here is the link as to how Ensembl predicts homologs. In summary, the steps involved are --

  1. Load longest translation of each gene from both species
  2. Run Blast against each other
  3. Generate clusters based on Blast scores
  4. Make multiple alignments of seqs within clusters
  5. From each cluster, build phylogenetic tree and infer orthology relationships

Your second question is very broad but as for the points-- Yes it is possible to have similar protein structure even when sequence is not wholly conserved and also to have different protein structure for similar sequences! I am not sure about 2D level.

The roles of non-conserved regions in orthologs can be myriad. There can be several orthologs in a species relative to one in another species( see one2many, many2one and many2many relationships in Ensembl link above). These extra orthologs arise by duplication and with evolutionary time they will diverge (rise of non-conserved regions) to adopt a new function (neo-functionalization), become dysfunctional (pseudogene) or sometimes each ortholog retains a part of original function or adopts a tissue specific expression (sub-functionalization).

ADD COMMENT

Login before adding your answer.

Traffic: 2538 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6