How To Find A 28Bp 'Primer' Sequence In A Genome?
6
4
Entering edit mode
14.7 years ago
Dan ▴ 540

Dumb question I know, but...

What tool to use?

I'd like to find forward and reverse primers within an approximate distance of each other... I'd like to accept 1 or two mismatches. The genome is ~1 Gbp. I have ~500 forward reverse pairs to find in the genome.

Any hints appreciated :-)

sequence genome primer • 9.2k views
ADD COMMENT
2
Entering edit mode

I think you need to be more specific with your question: what are you trying to do? 500 pairs of "primers": are you targeting something in particular or just random? Will these sequences be used for wet lab experiments like PCR or qPCR? Etc.

ADD REPLY
0
Entering edit mode

do you need some random primers or do you know your 500 targets ?

ADD REPLY
0
Entering edit mode

Yes, please be more clear. Do you (1) have 500 forward and reverse 28 bp sequences that you wish to align to the genome sequence? Or (2) want to generate 500 forward/reverse primers using the genome sequence?

ADD REPLY
5
Entering edit mode
14.7 years ago
Darked89 4.7k

Did not used it myself yet, but there is a new program claiming to be better than primer3:

See: http://nar.oxfordjournals.org/cgi/content/abstract/37/13/e95

Download it from: http://pythia.sourceforge.net

ADD COMMENT
0
Entering edit mode

thanks for the links, didn't know about that.

ADD REPLY
4
Entering edit mode
14.7 years ago
Michael 55k

I know that many people use the primer3 software to design primers. There are also two web-interfaces where you can try out the parameters, and there are many of them.

The program eprimer3 is part of the EMBOSS tools, maybe the best way of getting a command-line executable of primer3.

I didn't see a mismatch parameter though, because for a 'real' PCR-primer to allow for mismatches is a bad idea. I noted the '' in your question, so you might not really be looking for primers. So what is it then? If this is just a piece of sequence then you do not need to bother with uniqueness or melting temperature constraints.

ADD COMMENT
3
Entering edit mode
14.7 years ago
brentp 24k

[EDIT] Just saw this today which could be another piece of software to check out. I didn't read it carefully, but it looks like it builds on primer3. [/EDIT]

Since you have so few pairs, you could just run them through a read-aligner allowing mismatches, and find pairs that are aligned within X basepairs in the reference genome.

Actually, you could probably even use the paired-end feature of most aligners so that the aligner only find the pairs with nearby matches. For either of those methods, you could use bowtie.

As far as I know, primer3 does allow gaps and mismatches, so that may also be an option if you just want to see if you have good primers, but you'd probably have to use the command-line version.

or, you could use BLAST and find nearby hits between the pairs.

ADD COMMENT
1
Entering edit mode
14.7 years ago

You could try e-PCR or in-silico PCR. e-PCR is available to run locally.

ADD COMMENT
0
Entering edit mode

So can isPCR, Source code can be found here: http://users.soe.ucsc.edu/~kent/src/

ADD REPLY
1
Entering edit mode
13.4 years ago
Wubin Qu ▴ 170

MFEprimer has an command-line to check the specificity of primers, you can download it from the web site: http://biocompute.bmi.ac.cn/MFEprimer/

ADD COMMENT
0
Entering edit mode

Please use the new server: http://www.mfeprimer.com/

ADD REPLY
1
Entering edit mode
10.0 years ago

(better late than never)

One can use blastn with -task blast-short, and it's quite fast. As explained in the manual (section BLAST+ features), "blastn-short is BLASTN program optimized for sequences shorter than 50 bases". Further down (table C2), it says "blastn-short is optimized for sequences less than 30 nucleotides" and specifies word_size=7 (versus 11 for blastn) and reward=1 (versus 2 for blastn), all the other parameters remaining as for blastn.

In a Unix-like machine, with NCBI BLAST 2.2.26+:

zcat genome.fa.gz | formatdb -i stdin -l formatdb.log -p F -n genome
blastn -query primers.fa -db genome -task blastn-short -out primers_one_genome.txt -outfmt 7

If one knows both primers around the sequence they amplify, I recommend concatenating them into a single sequence, and map it to the genome. BLASTn-short will usually (likely?) return two best hits, one per primer, separated by a gap on the genome.

ADD COMMENT

Login before adding your answer.

Traffic: 2679 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6