Hi guys,
I'm trying to get something done that sound super simple but I'm stuck and I don't know how to go about it.
I have a large excel file with Primer sequences that someone designed for the human genome and I need to find out which part of the genome they cover. All i have is the sequence for the forward and reverse primer. So ideally I'd like to align the forward and reverse primer to the genome, I only want perfect matches to the genome because the primers have been designed in such a way.
There should be a linux tool to do this right? Maybe Blast, or bowtie? If anyone could guide me in the right direction I would much appreciate that.
Thanks :)
bbmap
has the following modes that might be of use...bowtie
is perfectly fine. I would set all mismatch and penalty parameters to 1000, set seed mismatches to zero etc. to ensure only perfect matches are returned.Thank you for the quick reply, I'll let you know how it works :)
For
bowtie
I think one could use-n 0 -k 1 -m 1 --best --strata -v0
. For bowtie2 I used in the past (but this was for NGS perfect matches not short primers)--end-to-end -N 0 --mp 10000 --np 10000 --rdg 10000 --rfg 10000
. Try things out a bit :)Depending on how many sequences there are just use
in silico
PCR tool: http://genome.ucsc.edu/cgi-bin/hgPcr?db=hg38.You can also use the web interface for
blat
for a more flexible search. http://genome.ucsc.edu/cgi-bin/hgBlat?command=start