Question

Can someone explain to me in relatively simple language how PCR primer, reverse primer, and probe sequences are generated?

0

Entering edit mode

9 weeks ago

Ryan • 0

I'm working on a workflow tool to help with PCR assay sequence testing and I'd like to understand a bit more about how these three sequences are generated.

My tool takes assay sequences and a query term and compares the assay sequences against a query performed on the NCBI nucleotide database and then outputs an html report highlighting mismatches, including a consensus obtained on the query data using Biopython's motifs consensus method.

Is the consensus useful in generating the assay sequences? Are the primers typically unchanging or rarely changing sections of the organism's DNA?

I'm just a hobby programmer working on a project for a friend, feel free to point me somewhere else if this forum is inappropriate for this kind of simple question.

Thanks!

reverse probe pcr primer • 776 views

ADD COMMENT • link 8 weeks ago by Ryan • 0

score 0 · Answer 1 · 2024-10-21

0

Entering edit mode

9 weeks ago

Joe 21k

It depends a bit what kind of assay you're developing.

For a diagnostic qPCR or sequencing task for (e.g.) a particular genetic disease, you'd often want to target unchanging regions of the DNA so that the primers work as widely as possible, but such that they span the region where the variant has appeared.

The probe is simply just part of the sequence that the primers amplify so that it binds internally.

If you were generating a diagnostic for an infectious disease instead for example, you might not be able to rely on the above approach because they mutate so much faster. In this case, you might use a somewhat degenerate primer such that mismatches don't cause a failure to amplify.

These are very sweeping statements though, so it very much depends on the specifics of the assay.

ADD COMMENT • link 9 weeks ago by Joe 21k

0

Entering edit mode

Thanks for your answer. I'm going to have to look up a lot of these terms, haha.

My test case (to see that the workflow tool performs as required) is human RSV. I'm guessing my friend wants to use it for infectious diseases as he mentioned RSV and some bacteria as well.

How do those differ from testing for a genetic disease? Does it matter if it's virus or bacteria, or is it related to the size/rate of mutation of the organism? Other factors?

ADD REPLY • link 9 weeks ago by Ryan • 0

1

Entering edit mode

Microorganisms evolve much faster than eukaryotes etc so your choice of primers are even more critical for them.

That said, there are (at least in bacteria) some relatively static sections such as ribosomal sequences (this is the basis of tools like 16S profiling and MultiLocus Sequence Typing (MLST). If you want to read up, MLST would be a good place to start.

Choice of primers will matter for technical reasons (e.g. primer pairs and probes ideally need to have similar binding temperatures (a.k.a melting temperatures).

Is this something you're building for your own interest or for a practical purpose? Not to rain on your parade but there is a lot of important biological subtlety to these types of assays/workflows and mature tools do already exist.

ADD REPLY • link 9 weeks ago by Joe 21k

0

Entering edit mode

I have a friend whose work includes testing assays for microorganisms. He currently has to do these steps by hand:

download a relevant set of sequences from NCBI nucleotide database
align his assay sequences with the data he downloaded
check the alignments in BioEdit for mismatches

He wanted a way to automate all three steps into one tool, so I'm doing that for him using his feedback and test case (he provided the query term and primer/probe sequences).

What my tool does is the following:

Take a user defined query term and fetch the relevant sequences in fasta format from NCBI's nucleotide database using Biopython's Entrez module
using clustalo, align the downloaded sequences with each of the primers and probe sequences
align the downloaded data set and get a consensus using Biopython's motifs module
compile a list of mismatches between the assays and the downloaded sequences
generate an html format report
save the report, alignments, and query in a timestamped report folder for review by the user

I'm just using python to string together the steps of his work using existing tools. Because I am not a subject matter expert, it's just a fun project for me that I hope is a useful tool that can save some time in my friend's workflow.

Rain on my parade in this case is perhaps encouraged since I know so little about the subject.

Thanks!

ADD REPLY • link 9 weeks ago by Ryan • 0

1

Entering edit mode

I would suggest taking a look at tools like Primer3 (there is a biopython module for this I believe), which is built specifically for designing and optimising primers from query sequences and will guard against many of the biological pitfalls.

It may even allow you to do the websearch itself, but if not, entrez is the right way to go.

As a note for your friend, I would also be surprised if there aren't already published primer sets for clinically significant pathogens like RSV, so he too may be reinventing the wheel, but perhaps he's ahead of me here.

ADD REPLY • link 8 weeks ago by Joe 21k

0

Entering edit mode

Thanks for the suggestions. RSV has just been the test case to make sure the script works as expected. He hasn't asked me to help him with designing the primers or probe so I haven't looked into it at all. Really, it's just reducing the amount of manual busy work he has to do which I guess includes checking whether primers are still valid for a given pathogen. Really appreciate all the replies here at Biostar!

Edit: I believe the primers/probe he gave me for testing are published ones, where testing means, 'does the script produce the expected results', not trying to create new primers or probes or anything.

ADD REPLY • link 8 weeks ago by Ryan • 0