Please help me to understand the following BLAST result,
Query= sp|P14738|FNBA_STAA8 Fibronectin-binding protein A OS=Staphylococcus
aureus (strain NCTC 8325) GN=fnbA PE=1 SV=1
Length=1018
> gi|49484704|ref|YP_041928.1| fibronectin-binding protein precursor
[Staphylococcus aureus subsp. aureus MRSA252]
Length=965
Score = 1404 bits (3633), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 748/1022 (73%), Positives = 826/1022 (81%), Gaps = 61/1022 (6%)
Please notice the length of the query sequence i.e 1018 and the length from which the %identities is being shown i.e 1022. What I am not able to understand is where did this 1022 come from? I know it may be a naive question but please consider that I am new to this field.
Maybe the length is the target's length? E.g. your 1018 query are aligned to a sequence of length 1022 and 748 of the sequence are identical?
But notice the length of the target mentioned which is 965.
Can you check if there is any indel? e.g.
Yes, this seems to be case. I aligned the query and the subject sequences and there are 4 gaps in the query sequence in the alignment.
Okay, so the 4 gaps have been added to the query sequence and therefore the length is now 1022. but how do I know if there are actually 4 gaps? Does
QL--YK
means 4 gaps?No, there are two gaps in the
QL--YK
region and two other gaps are somewhere else in the query sequence. I copy pasted the portions of the alignment that contains the 4 gaps in the query sequence.Okay, I get it now. Thank You so much Siva and Sam, that was very kind of you :)
Yeah, it's there,
What does it mean?
It means either there are insertions in your subject sequence or deletions in your query sequence in those 4 positions. Or it could also be due to sequencing errors.