Entering edit mode
4.2 years ago
mag
▴
20
Dear all, I need a chromosome's basepair start and basepair end points (genetic locations) or in other words genome coordinates (hg19) from the UCSC genome browser, can anyone help me? how can I find it?
Thanks
Thank you @emilyprice, I couldn't see all start and endpoints. Is that link list all the points? Here is some part of the list from the link,
chr1 249250621
chr2 243199373
chr3 198022430
....
chr6_ssto_hap7 4928567
chr6_mcf_hap5 4833398
The start is 1 and the end is the number next to the chromosome name.
The start is actually zero; the convention is 0-based, half-open.
Thanks Alex. This is why I have problem with this:) :
What UCSC visualizes in its browser uses a 1-based index, because most of us have ten digits on our hands and that makes counting of things more "natural" in a visual rendering.
What UCSC publishes as datasets are usually (not always, but generally) zero-based, half-open, which helps make distance calculations done via scripts or programs more "natural".
When you want to know how long something is, or how far away it is from something else, it is usually easier to subtract a pair of numbers, than it is to subtract and then add one base to correct the result.