How do N's in DNA sequences affect multi-sequence alignments for PCR assay testing?

0

Entering edit mode

11 weeks ago

Ryan • 0

I'm writing a workflow tool for testing PCR assay primer/probe sequences. I am not a bioinformatics person or scientist, I'm writing this tool to help a friend.

I am using a search term to download some sequences from the NCBI nucleotide database. Currently, I am removing any sequence that contains at least one N in the sequence record.

My question is, how do N's affect a multi sequence alignment? Is there an acceptable threshold of N's for sequences that will be aligned with a primer/probe sequence to test PCR assays?

My friend wasn't quite sure how we should handle sequences containing N's so I thought I'd ask here.

Thanks!

multi-sequence PCR alignment • 504 views

ADD COMMENT • link 11 weeks ago by Ryan • 0

1

Entering edit mode

Depending on the program you are using solitary N's may not have deleterious effect on your testing. You may want to use curated sequences like RefSeq which should not have this problem.

ADD REPLY • link 11 weeks ago by GenoMax 148k

0

Entering edit mode

I am using clustal omega to perform the alignments. My friend is required to use genbank so I believe I'm stuck with it for this tool

ADD REPLY • link 11 weeks ago by Ryan • 0

1

Entering edit mode

RefSeq is actually curated subset of GenBank. The sequences will be non-redundant and will save you time/effort: https://ncbi.nlm.nih.gov/refseq/about/