Chry In 1000G Vs Hg19
1
1
Entering edit mode
12.1 years ago
Gabriel R. ★ 2.9k

I looked at the Y chromosome in hg19 and 1000g and they seem to differ despite having the same # of characters. Has anybody noticed this ? Why do they differ ?

genome chromosome • 4.8k views
ADD COMMENT
1
Entering edit mode

You should link to the source of the data in each case so we can look at it. However: HG19 is a consensus sequence, 1000G is the sequences from many individuals. So it's not surprising that they differ since the goal of 1000G is indeed to understand variation. There is in fact no single "Y chromosome in 1000g."

ADD REPLY
0
Entering edit mode

OK, now I see that you are referring to the reference sequences used by the 1000G project.

ADD REPLY
1
Entering edit mode

You should at least point out one base-pair difference to support your argument. So far as I know, they are the same. EDIT: I was wrong. They are different. We should use the 1000g genome if possible.

ADD REPLY
7
Entering edit mode
12.1 years ago
Neilfws 49k

1000G uses sequences from Ensembl (see README at location in your FTP link).

It seems that Ensembl has a slightly different procedure for inserting N into the sequence scaffolds. The issue is discussed in this mailing list thread.

ADD COMMENT
2
Entering edit mode

Edit my own comments. I see. I made the build36 version of the genome for 1000g. At that time, there was this difference. My colleague later told me that UCSC have changed to the Ensembl way since hg19, but UCSC still keeps the pseudoautosomal regions on chrY. This is a wrong decision. I would discourage to use the UCSC genome for the mapping purpose.

ADD REPLY
1
Entering edit mode

thank you for digging out that post !

ADD REPLY

Login before adding your answer.

Traffic: 1853 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6