Question

How To Find A 28Bp 'Primer' Sequence In A Genome?

4

Entering edit mode

14.8 years ago

Dan ▴ 540

Dumb question I know, but...

What tool to use?

I'd like to find forward and reverse primers within an approximate distance of each other... I'd like to accept 1 or two mismatches. The genome is ~1 Gbp. I have ~500 forward reverse pairs to find in the genome.

Any hints appreciated :-)

sequence genome primer • 9.2k views

ADD COMMENT • link updated 14 months ago by Ram 44k • written 14.8 years ago by Dan ▴ 540

2

Entering edit mode

I think you need to be more specific with your question: what are you trying to do? 500 pairs of "primers": are you targeting something in particular or just random? Will these sequences be used for wet lab experiments like PCR or qPCR? Etc.

ADD REPLY • link 14.8 years ago by Nicojo ★ 1.1k

0

Entering edit mode

do you need some random primers or do you know your 500 targets ?

ADD REPLY • link 14.8 years ago by Pierre Lindenbaum 164k

0

Entering edit mode

Yes, please be more clear. Do you (1) have 500 forward and reverse 28 bp sequences that you wish to align to the genome sequence? Or (2) want to generate 500 forward/reverse primers using the genome sequence?

ADD REPLY • link 14.8 years ago by Neilfws 49k

Ram · Answer 1 · 2010-03-08

5

Entering edit mode

14.8 years ago

Darked89 4.7k

Did not used it myself yet, but there is a new program claiming to be better than primer3:

See: http://nar.oxfordjournals.org/cgi/content/abstract/37/13/e95

Download it from: http://pythia.sourceforge.net

ADD COMMENT • link updated 14 months ago by Ram 44k • written 14.8 years ago by Darked89 4.7k

0

Entering edit mode

thanks for the links, didn't know about that.

ADD REPLY • link 14.8 years ago by brentp 24k

Ram · Answer 2 · 2010-03-07

I know that many people use the primer3 software to design primers. There are also two web-interfaces where you can try out the parameters, and there are many of them.

The program eprimer3 is part of the EMBOSS tools, maybe the best way of getting a command-line executable of primer3.

I didn't see a mismatch parameter though, because for a 'real' PCR-primer to allow for mismatches is a bad idea. I noted the '' in your question, so you might not really be looking for primers. So what is it then? If this is just a piece of sequence then you do not need to bother with uniqueness or melting temperature constraints.

Ram · Answer 3 · 2010-03-07

[EDIT] Just saw this today which could be another piece of software to check out. I didn't read it carefully, but it looks like it builds on primer3. [/EDIT]

Since you have so few pairs, you could just run them through a read-aligner allowing mismatches, and find pairs that are aligned within X basepairs in the reference genome.

Actually, you could probably even use the paired-end feature of most aligners so that the aligner only find the pairs with nearby matches. For either of those methods, you could use bowtie.

As far as I know, primer3 does allow gaps and mismatches, so that may also be an option if you just want to see if you have good primers, but you'd probably have to use the command-line version.

or, you could use BLAST and find nearby hits between the pairs.

Ram · Answer 4 · 2010-03-26

1

Entering edit mode

14.7 years ago

Madelaine Gogol 5.3k

You could try e-PCR or in-silico PCR. e-PCR is available to run locally.

ADD COMMENT • link updated 14 months ago by Ram 44k • written 14.7 years ago by Madelaine Gogol 5.3k

0

Entering edit mode

So can isPCR, Source code can be found here: http://users.soe.ucsc.edu/~kent/src/

ADD REPLY • link 14.0 years ago by Casey Bergman 18k

Ram · Answer 5 · 2011-07-14

1

Entering edit mode

13.4 years ago

Wubin Qu ▴ 170

MFEprimer has an command-line to check the specificity of primers, you can download it from the web site: http://biocompute.bmi.ac.cn/MFEprimer/

ADD COMMENT • link updated 14 months ago by Ram 44k • written 13.4 years ago by Wubin Qu ▴ 170

0

Entering edit mode

Please use the new server: http://www.mfeprimer.com/

ADD REPLY • link 7.4 years ago by Wubin Qu ▴ 170

Ram · Answer 6 · 2014-11-20

(better late than never)

One can use blastn with -task blast-short, and it's quite fast. As explained in the manual (section BLAST+ features), "blastn-short is BLASTN program optimized for sequences shorter than 50 bases". Further down (table C2), it says "blastn-short is optimized for sequences less than 30 nucleotides" and specifies word_size=7 (versus 11 for blastn) and reward=1 (versus 2 for blastn), all the other parameters remaining as for blastn.

In a Unix-like machine, with NCBI BLAST 2.2.26+:

zcat genome.fa.gz | formatdb -i stdin -l formatdb.log -p F -n genome
blastn -query primers.fa -db genome -task blastn-short -out primers_one_genome.txt -outfmt 7

If one knows both primers around the sequence they amplify, I recommend concatenating them into a single sequence, and map it to the genome. BLASTn-short will usually (likely?) return two best hits, one per primer, separated by a gap on the genome.