Hello,
I have a file containing a lot of SNP in a tabular format with the chromosome and the position on the chromosome and the allele change
here is an exemple
chr1 2898298 rsxxxxxx A T
The file contain thousands of data like this
I want to extract flanking region of each snp and export both sequences in fasta format for each SNP with both allele, for my example one with A and one sequence with T flanked by 25 bp from each side
I want to do it with UCSC not Ensembl, I browsed biostar and I found how to connect to UCSC via MySQL. Can someone help me to design the query using mysql, I will wrap it in a perl script and try to export both sequences
Regards
Patrick
what is wrong with these solutions? Get Flanking Sequence Given A List Of Positions
I'd download the latest human genome release and use bio::db::sam modules' access reference functions to grab the flanking regions. It's very very fast.