Question

Which conservation score to use to measure conservation of splice sites?

3

Entering edit mode

5.8 years ago

i.sudbery 21k

I would like to measure the conservation of a particular subset of human splice sites and compare them to all other splice sites and matched sites that have a canonical splice-site sequence, but are not spliced. Which scores should I use? UCSC has both phasCons and PhyloP scores for hg38 from alignments of 100 vertebrates, and there is also GERP (although unfortunately UCSC doesn't have GERP for hg38). I'm not sure I understand which of these would be best.

compartive-genomics conservation • 3.5k views

ADD COMMENT • link updated 2.3 years ago by Ram 45k • written 5.8 years ago by i.sudbery 21k

2

Entering edit mode

I used phyloP scores to build a 'risk score' algorithm in the past, purely because the scores are intuitive and measured as negative log10 p-values (I believe), with positive meaning more conserved and negative meaning less conserved. I used the scores as priors in a Bayesian regression model for the variants of interest, with the mean prior being the mean phyloP score for all bases across the genome. This had the effect of 'adjusting' the derived p-values and odds ratios from the model.

ADD REPLY • link 5.8 years ago by Kevin Blighe 89k

score 3 · Answer 1 · 2019-06-13

The following text is from UCSC's page on PhastCons / PhyloP (LINK). I think for the size of splice-sites that the fine grained PhyloP score might be most useful.

"PhastCons is a hidden Markov model-based method that estimates the probability that each nucleotide belongs to a conserved element, based on the multiple alignment. It considers not just each individual alignment column, but also its flanking columns. By contrast, phyloP separately measures conservation at individual columns, ignoring the effects of their neighbors. As a consequence, the phyloP plots have a less smooth appearance than the phastCons plots, with more "texture" at individual sites. The two methods have different strengths and weaknesses. PhastCons is sensitive to "runs" of conserved sites, and is therefore effective for picking out conserved elements. PhyloP, on the other hand, is more appropriate for evaluating signatures of selection at particular nucleotides or classes of nucleotides (e.g., third codon positions, or first positions of miRNA target sites)."

score 2 · Answer 2 · 2019-06-13

2

Entering edit mode

5.8 years ago

i.sudbery 21k

@Emily Ensembl has point out that Ensembl has hg38 GERP scores here: ftp://ftp.ensembl.org/pub/current_compara/conservation_scores/88_mammals.gerp_conservation_score/

ADD COMMENT • link 5.8 years ago by i.sudbery 21k

score 0 · Answer 3 · 2022-12-07

0

Entering edit mode

2.3 years ago

BaDoi Phan • 0

There's an updated version of the ENSEMBL GERP score in hg38 coordinates at ftp://ftp.ensembl.org/pub/current_compara/conservation_scores/91_mammals.gerp_conservation_score/gerp_conservation_scores.homo_sapiens.GRCh38.bw

ADD COMMENT • link 2.3 years ago by BaDoi Phan • 0