I'm working on a workflow tool to help with PCR assay sequence testing and I'd like to understand a bit more about how these three sequences are generated.
My tool takes assay sequences and a query term and compares the assay sequences against a query performed on the NCBI nucleotide database and then outputs an html report highlighting mismatches, including a consensus obtained on the query data using Biopython's motifs consensus method.
Is the consensus useful in generating the assay sequences? Are the primers typically unchanging or rarely changing sections of the organism's DNA?
I'm just a hobby programmer working on a project for a friend, feel free to point me somewhere else if this forum is inappropriate for this kind of simple question.
Thanks!
It depends a bit what kind of assay you're developing.
For a diagnostic qPCR or sequencing task for (e.g.) a particular genetic disease, you'd often want to target unchanging regions of the DNA so that the primers work as widely as possible, but such that they span the region where the variant has appeared.
The probe is simply just part of the sequence that the primers amplify so that it binds internally.
If you were generating a diagnostic for an infectious disease instead for example, you might not be able to rely on the above approach because they mutate so much faster. In this case, you might use a somewhat degenerate primer such that mismatches don't cause a failure to amplify.
These are very sweeping statements though, so it very much depends on the specifics of the assay.
Thanks for your answer. I'm going to have to look up a lot of these terms, haha.
My test case (to see that the workflow tool performs as required) is human RSV. I'm guessing my friend wants to use it for infectious diseases as he mentioned RSV and some bacteria as well.
How do those differ from testing for a genetic disease? Does it matter if it's virus or bacteria, or is it related to the size/rate of mutation of the organism? Other factors?
An experimental biology focused reddit or Biology stackexchange may be the better forums for these questions.
Thanks, I'll look for some appropriate forums
Microorganisms evolve much faster than eukaryotes etc so your choice of primers are even more critical for them.
That said, there are (at least in bacteria) some relatively static sections such as ribosomal sequences (this is the basis of tools like 16S profiling and MultiLocus Sequence Typing (MLST). If you want to read up, MLST would be a good place to start.
Choice of primers will matter for technical reasons (e.g. primer pairs and probes ideally need to have similar binding temperatures (a.k.a melting temperatures).
Is this something you're building for your own interest or for a practical purpose? Not to rain on your parade but there is a lot of important biological subtlety to these types of assays/workflows and mature tools do already exist.
I have a friend whose work includes testing assays for microorganisms. He currently has to do these steps by hand:
He wanted a way to automate all three steps into one tool, so I'm doing that for him using his feedback and test case (he provided the query term and primer/probe sequences).
What my tool does is the following:
I'm just using python to string together the steps of his work using existing tools. Because I am not a subject matter expert, it's just a fun project for me that I hope is a useful tool that can save some time in my friend's workflow.
Rain on my parade in this case is perhaps encouraged since I know so little about the subject.
Thanks!