Question

No variant in the Pseudoautosomal regions of gomad chrY ?

1

Entering edit mode

13 days ago

Pierre Lindenbaum 166k

Hi all,

There is no variant in the PAR regions of the chrY in gnomad.

$ wget -qO - https://storage.googleapis.com/gcp-public-data--gnomad/release/4.1/vcf/genomes/gnomad.genomes.v4.1.sites.chrY.vcf.bgz |\
bcftools view --no-header --targets "chrY:10001-2781479,chrY:56887903-57217415" |\
wc -l


0

and on my side, unless I'm wrong, I don't have any variant in those regions using my WGS data (contains males+females). why ? is it highly conserved ? do you have any reference please ?

chrY PAR vcf gnomad Pseudoautosomal • 509 views

ADD COMMENT • link 13 days ago by Pierre Lindenbaum 166k

score 3 · Accepted Answer · 2025-04-27

GPT says the following. Until we get an answer this seems logical.

There are actually variants in the pseudoautosomal regions (PARs) of the human genome, but they’re relatively fewer and behave differently than variants in other parts of the genome. Here's why that might seem like there are "no variants":

1. High Sequence Identity Between X and Y

PAR regions are identical (or nearly identical) between the X and Y chromosomes. This makes it technically difficult to map sequencing reads accurately to the right chromosome in these regions, especially with short-read sequencing. As a result, many variants may be missed or misassigned.

2. Recombination Keeps Them Homogenized

PAR regions are subject to regular recombination during male meiosis—just like autosomes. This keeps the sequences in these regions more homogenized between the X and Y chromosomes, which reduces the accumulation of unique variants over time.

3. Strong Purifying Selection

Because PAR genes are often dosage-sensitive (they're expressed from both sex chromosomes, unlike other X-linked genes that undergo X-inactivation), deleterious variants are more strongly selected against, reducing variation.

4. Reference Bias and Database Gaps

Some variant databases and genome builds underrepresent or underreport variants in PARs because:

They exclude Y-PAR variants.
They collapse PAR sequences into a single representation.
Or they filter them out due to uncertainty in read mapping.