Question

From genomic coordinate to Reference base

0

Entering edit mode

10.2 years ago

Nicola Casiraghi ▴ 500

Hi, I have an extensive set of genomic coordinates (i.e. chr1:9071988) and I need to automatically get the reference (Human Genome hg19 ) base corresponding to that positions.

Could you provide me some hints to solve this issue please?

Thank you in advance!

genome • 3.6k views

ADD COMMENT • link updated 3.1 years ago by Ram 45k • written 10.2 years ago by Nicola Casiraghi ▴ 500

0

Entering edit mode

Thank you everybody for the useful comments and suggestions, really appreciated! samtools faidx and bedtools getfasta work both perfectly for my purpose. Many thanks again!

ADD REPLY • link updated 3.1 years ago by Ram 45k • written 10.2 years ago by Nicola Casiraghi ▴ 500

1

Entering edit mode

10.2 years ago

Alex Reynolds 36k

You could easily script the following DAS lookup for each of your coordinates:

$ wget -qO- http://genome.ucsc.edu/cgi-bin/das/hg19/dna?segment=chr1:9071988,9071988 | grep -v '^<'
c

ADD COMMENT • link updated 3.1 years ago by Ram 45k • written 10.2 years ago by Alex Reynolds 36k

1

Entering edit mode

10.2 years ago

David Langenberger 11k

You can try fastacmd.

Create DB (run this only once):

formatdb -i hg19.fa -o T -p F -V

Get nucleotide:

fastacmd -d hg19.fa -L 9071988,9071989 -s "chr1" (-S 2 if neg. Strang)

ADD COMMENT • link updated 3.1 years ago by Ram 45k • written 10.2 years ago by David Langenberger 11k

Ram · Accepted Answer · 2015-01-22

3

Entering edit mode

10.2 years ago

Devon Ryan 105k

samtools faidx and some shell scripting should work.

ADD COMMENT • link 10.2 years ago by Devon Ryan 105k

0

Entering edit mode

What? Sorry, I don't get it, again. samtools? Why samtools? That makes no sense! ;)

ADD REPLY • link 10.2 years ago by Lars ★ 1.1k

0

Entering edit mode

Samtools can be used to extract subsequences from a fasta file. That's what the faidx command does.

ADD REPLY • link 10.2 years ago by Devon Ryan 105k

0

Entering edit mode

Wow! Didn't know that! That is a nice way of doing that! Thank you for that information!

Just looked it up:

samtools faidx hg19.fasta chr1:9071988,9071988

ADD REPLY • link updated 3.1 years ago by Ram 45k • written 10.2 years ago by Lars ★ 1.1k

1

Entering edit mode

You may also find the bedtools getfasta command useful.

ADD REPLY • link updated 3.1 years ago by Ram 45k • written 10.2 years ago by Devon Ryan 105k