Question

Compute Blastx E-Value

0

Entering edit mode

13.1 years ago

snayfach • 0

Does anyone know how blastx e-values are computed? Is it the same as blastp, except the query sequence is divided by 3? I've tried using E = (m/3)n2^(-S), but I get slightly different results than from the blastx output. I need to compute blastx e-value because my blast database is split up into several parts.

Thanks in advance

blast ncbi • 3.0k views

ADD COMMENT • link updated 11.7 years ago by Biostar 20 • written 13.1 years ago by snayfach • 0

Ram · Answer 1 · 2012-04-17

0

Entering edit mode

13.1 years ago

Bill Pearson ★ 1.1k

I believe the blastx e-value uses 2*m, not m/3, because all 6-frames are included.

ADD COMMENT • link updated 6.4 years ago by Ram 45k • written 13.1 years ago by Bill Pearson ★ 1.1k

Ram · Answer 2 · 2012-04-17

I don't understand why, but this formula actually got me pretty close (well under an order of magnitude difference):

E = (m/6)(n)(2^-S)

-m is the sequence length in nucleotides
-n is the database size in residues
-S is the bitscore

It should be noted that blast performs a "finite size correction" in which a value is subtracted from the query and database sequence length. It's unclear what this value actually is.

http://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastNews