Question

Getting human-specific regions of the human genome

0

Entering edit mode

3.9 years ago

arsala521 ▴ 50

Hi everyone,

I have a list of human genomic regions (bed file enlisting genomic coordinates). From that list I want to extract those regions which are human-specific. I want to ask what can be appropriate pipeline for that.

I found one study using this pipeline: Using liftOver utility for converting the coordinates to mouse genome (mm9), then those not converted to the marmoset genome (CalJac3), and again those not converted to the chimp genome (PanTro2) to finally get human-specific regions.

Should I follow this, or could there be some alternate or more efficient way?

Thanks in advance

human-specific regions • 1.1k views

ADD COMMENT • link updated 3.9 years ago by Alex Reynolds 36k • written 3.9 years ago by arsala521 ▴ 50

0

Entering edit mode

I think you are looking at mappability to identify unique regions. You can take a look at this recent paper.

ADD REPLY • link 3.9 years ago by GenoMax 147k

0

Entering edit mode

Thank you. Just looked into this paper. What I understood is, it is about identifying unique regions within a single genome.

ADD REPLY • link 3.9 years ago by arsala521 ▴ 50

0

Entering edit mode

From that list I want to extract those regions which are human-specific

This is a little tricky because of the "orthology" That Alex mentions below. There will always be some sequence similarity between humans and our close relatives (monkeys, mice etc) so there may be no true human-specific regions that are coding.

ADD REPLY • link 3.9 years ago by GenoMax 147k

0

Entering edit mode

Something like 96% of human and chimpanzee sequence is identical, by one measure. This may be a very difficult problem, without more detail in the question.

ADD REPLY • link 3.9 years ago by Alex Reynolds 36k

0

Entering edit mode

Thank you for the guide. I will look into more details.

ADD REPLY • link 3.9 years ago by arsala521 ▴ 50

score 0 · Answer 1 · 2021-01-05

0

Entering edit mode

3.9 years ago

Alex Reynolds 36k

Starting with orthologous genes ("orthologs"), you might take the subset of human genes which do not overlap or contain those orthologs, by some threshold of base or percentage overlap.

ADD COMMENT • link 3.9 years ago by Alex Reynolds 36k

0

Entering edit mode

Thank you, but I have more intergenic regions in my starting list.

ADD REPLY • link 3.9 years ago by arsala521 ▴ 50

0

Entering edit mode

Perhaps perform a BLAST nucleotide search between genomes. High-scoring regions would presumably share similarity and may be orthologous. You could also incorporate phyloP or phastcons conservation scores over your intergenic regions, looking for lower-than-background scoring regions as further evidence of speciation among primate alignment.