Ideas or advise for data visualization that shows genotype and phenotpye relationship
3
2
Entering edit mode
10.5 years ago
kevinjspring ▴ 20

I thought biostars may be a good place to ask this question.

I have a large database of similar protein sequences with one to multiple amino acid substitutions and a phenotypic value for each of these protein sequences. What would be a good way to visualize the relationship between the single or multiple amino acid substitutions and its effect on phenotypic change? Currently I have a graph such as below but it does not capture the protein sequence.

figures data visualization • 3.2k views
ADD COMMENT
6
Entering edit mode
10.5 years ago
Katie D'Aco ★ 1.1k

What about creating a phylogeny of the protein sequences and color coding the branches by phenotype?

Something like this:

ADD COMMENT
0
Entering edit mode

neat visualization!

ADD REPLY
0
Entering edit mode

I'm totally going to have to find an excuse to make a dendrogram like that. What program did you use?

ADD REPLY
1
Entering edit mode

ummm...I used google image search ;)

I should have credited the source in my answer, here is the blog post it is from: http://blog.phytools.org/2012/11/mapping-reconstructed-value-of.html

The author uses a package called phytools

ADD REPLY
0
Entering edit mode

Thanks Katie. It is written in R too.

ADD REPLY
2
Entering edit mode
10.5 years ago

Draw two graphs, one of top of the other, at the same width:

  1. A sequence logo showing the amino acid sequence; substitutions will be visualized as lower information or higher entropy residues.
  2. A bar or violin plot showing measured phenotypic change at each residue, aligned to the residues in the sequence logo.

Repeat for each amino acid sequence; tile the figures if they can be put into a group.

ADD COMMENT
0
Entering edit mode

This doesn't seem to work with phenotypic measurements on proteins with multiple mutations.

ADD REPLY
1
Entering edit mode
10.5 years ago

You may also be able to use the "genotype network" formalism, creating a separate network for each phenotype. In a genotype network, a vertex represents a sequence and edges connect vertices if their corresponding sequences are separated by a single, small mutation. For more information on this formalism, see Wagner (2008) Nature Reviews Genetics, 9:965-974. For the foundations of this formalism, see Schuster et al. (1994) Proc. R. Soc. Lond. B 255:279-284.

ADD COMMENT

Login before adding your answer.

Traffic: 2499 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6