Getting human-specific regions of the human genome
1
0
Entering edit mode
3.9 years ago
arsala521 ▴ 50

Hi everyone,

I have a list of human genomic regions (bed file enlisting genomic coordinates). From that list I want to extract those regions which are human-specific. I want to ask what can be appropriate pipeline for that.

I found one study using this pipeline: Using liftOver utility for converting the coordinates to mouse genome (mm9), then those not converted to the marmoset genome (CalJac3), and again those not converted to the chimp genome (PanTro2) to finally get human-specific regions.

Should I follow this, or could there be some alternate or more efficient way?

Thanks in advance

human-specific regions • 1.1k views
ADD COMMENT
0
Entering edit mode

I think you are looking at mappability to identify unique regions. You can take a look at this recent paper.

ADD REPLY
0
Entering edit mode

Thank you. Just looked into this paper. What I understood is, it is about identifying unique regions within a single genome.

ADD REPLY
0
Entering edit mode

From that list I want to extract those regions which are human-specific

This is a little tricky because of the "orthology" That Alex mentions below. There will always be some sequence similarity between humans and our close relatives (monkeys, mice etc) so there may be no true human-specific regions that are coding.

ADD REPLY
0
Entering edit mode

Something like 96% of human and chimpanzee sequence is identical, by one measure. This may be a very difficult problem, without more detail in the question.

ADD REPLY
0
Entering edit mode

Thank you for the guide. I will look into more details.

ADD REPLY
0
Entering edit mode
3.9 years ago

Starting with orthologous genes ("orthologs"), you might take the subset of human genes which do not overlap or contain those orthologs, by some threshold of base or percentage overlap.

ADD COMMENT
0
Entering edit mode

Thank you, but I have more intergenic regions in my starting list.

ADD REPLY
0
Entering edit mode

Perhaps perform a BLAST nucleotide search between genomes. High-scoring regions would presumably share similarity and may be orthologous. You could also incorporate phyloP or phastcons conservation scores over your intergenic regions, looking for lower-than-background scoring regions as further evidence of speciation among primate alignment.

ADD REPLY
0
Entering edit mode

Okay, thank you for the guide. I will look into these details.

ADD REPLY

Login before adding your answer.

Traffic: 1891 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6