Hi,
I have a collection of expression probe sequences, their chromosomes and their coordinates. I wish to identify which genome build my coordinates come from. Is there a simply way to do this?
Thanks, D.
Hi,
I have a collection of expression probe sequences, their chromosomes and their coordinates. I wish to identify which genome build my coordinates come from. Is there a simply way to do this?
Thanks, D.
Assuming that you are talking about human genome. As there is difference between chromosome lengths of hg17, hg18 and hg19, you can check your BED file (chromosome, start and stop) for each chromosome. Example- chromosome1 in hg18 has 247,249,719 bp and hg19 has 249,250,621 bp. If you have any coordinate with chr1 248,250,621 249,150,621 then you can say the probes belong to hg19. You have to do this for all chromosomes.
But the problem is, if your probes are from hg19 but all the coordinates lies in hg18 also.
You can give it a try, and if it is from hg19 and you got some coordinate as I showed in example, your problem will be solved. I really want to know if we have any good solution for this.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
You say you know the sequence of the probe, as well as the coordinates? If so, just paste a few of your coordinates into UCSC and click the "DNA" button up top. If it matches your probe sequence, you have the right build. Else, try the other build.
you know the species, right?
The species is human - I just need to know if they are from NCBI 36 or GRCh37.
Ahh. If you know the probe sequences, then aaron is right. I thought you have only chromosome, start and end position.
@aaronQuinlan - that worked a treat! I am knee deep in UCSC documentation. I have never used this resource before. It's super! If you wish to submit your comment as an answer, i'll mark it correct.