Hello all,
I'm using Plastx (some faster equivalent of Blastx) to compare my 500k contigs to ncbi database in order to check some suspected unknown contamination. I would like to only select the results with a percentage of identity > 95%. However there is no such "percentage identity colum" in the output file.
I have some others info about HSP_identity, intensity and alignment. So I was wondering if it is possible to calcultate this percentage myself.
Thanks !
Thanks for your help ! I'm a bit confused since I have specified -maxhsps 1 in my command and somehow I sometimes get result >100 in the column HSP_identity, more especialy when the HSP e-value is low.
EDIT : I found the answer of my question : on plast, percentage identity = HSP_identity / HSP_align_length. It's sometimes slightly different from blast results due to the calculation of the alignment length that can vary up to 2bp.