Which conservation score to use to measure conservation of splice sites?
3
3
Entering edit mode
5.5 years ago

I would like to measure the conservation of a particular subset of human splice sites and compare them to all other splice sites and matched sites that have a canonical splice-site sequence, but are not spliced. Which scores should I use? UCSC has both phasCons and PhyloP scores for hg38 from alignments of 100 vertebrates, and there is also GERP (although unfortunately UCSC doesn't have GERP for hg38). I'm not sure I understand which of these would be best.

compartive-genomics conservation • 3.1k views
ADD COMMENT
2
Entering edit mode

I used phyloP scores to build a 'risk score' algorithm in the past, purely because the scores are intuitive and measured as negative log10 p-values (I believe), with positive meaning more conserved and negative meaning less conserved. I used the scores as priors in a Bayesian regression model for the variants of interest, with the mean prior being the mean phyloP score for all bases across the genome. This had the effect of 'adjusting' the derived p-values and odds ratios from the model.

ADD REPLY
3
Entering edit mode
5.5 years ago
Ian 6.1k

The following text is from UCSC's page on PhastCons / PhyloP (LINK). I think for the size of splice-sites that the fine grained PhyloP score might be most useful.

"PhastCons is a hidden Markov model-based method that estimates the probability that each nucleotide belongs to a conserved element, based on the multiple alignment. It considers not just each individual alignment column, but also its flanking columns. By contrast, phyloP separately measures conservation at individual columns, ignoring the effects of their neighbors. As a consequence, the phyloP plots have a less smooth appearance than the phastCons plots, with more "texture" at individual sites. The two methods have different strengths and weaknesses. PhastCons is sensitive to "runs" of conserved sites, and is therefore effective for picking out conserved elements. PhyloP, on the other hand, is more appropriate for evaluating signatures of selection at particular nucleotides or classes of nucleotides (e.g., third codon positions, or first positions of miRNA target sites)."

ADD COMMENT
2
Entering edit mode
5.5 years ago

@Emily Ensembl has point out that Ensembl has hg38 GERP scores here: ftp://ftp.ensembl.org/pub/current_compara/conservation_scores/88_mammals.gerp_conservation_score/

ADD COMMENT
0
Entering edit mode
ADD COMMENT

Login before adding your answer.

Traffic: 1546 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6