Hello everyone, this is my problem:
I have a list of SNPs coordinates in this format (from chr 1 to chr Y, ~580k SNPs), they are under build GRCh37:
chr1:108681808
chr1:109440678
chr1:109479801
chr1:110655430
chr1:11193226
chr1:113933669
chr1:115258741
chr1:115527488
...
And I have the Reference Genome build37 in .fa format
>chr1
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNN....gctttatacaatctat
ttgttactttttattctattttgcatttt
gttcctttgcctgaataattcactttggt
ctgcaatggctaattgcaatagattt...
Is there tool to obtain the sequence corresponding to each given coordinate?
chr1:108681808 A
chr1:109440678 C
chr1:109479801 T
chr1:110655430 G
chr1:11193226 A
chr1:113933669 T
chr1:115258741 C
chr1:115527488 C
Hope you can help me
exact duplicate of
getting sequence based on chromosome no and coordinates from whole genome fasta file