(cross posted on seqanswers: http://seqanswers.com/forums/showthread.php?t=12542 )
Hi all,
I know, by capillary sequencing, that one of my samples contains a mutation at position:120458492
.
Some reads were aligned with bwa and I can clearly see my mutation using samtools tview.
120458491 120458501
CCTGCTCTGGGGAGCTATGCCAGGATGGGTGCC
........R........................
.....C..A..................C.....
........C. ......................
............ ...................
........A..... ...............
,,,,,,,,a,,,,,,,,,,,,,,,, ......
,,,,,,,,a,,,,,,,,,,,,,,,,,,, .
........A........................
........A........................
.............................C...
........A........................
.................................
........A........................
.................................
.................................
........A........................
......................C..........
........A........................
,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
........A........................
.................................
................................
...............................
.....A........................
,,,,,,,,,gg,,,,,,,,,,,t,,,,
.A........................
.........................
however, we I call the mutations using samtools mpileup
${SAMTOOLS}/samtools mpileup -uf ${HG19} sample.bam |\
${SAMTOOLS}/bcftools/bcftools view -bvcg - > snps.bcf
${SAMTOOLS}/bcftools/bcftools view snps.bcf | gzip --best > snps.vcf.gz
But I can't see the mutation in the VCF.
How can I know why mpileup (or bcftools ?) skipped this mutation ?
Thanks !
Pierre
EDIT:
...and here is the output of pileup (not mpileup):
(...)
chr1 120458492 G 26 C.AaaAA.A.A..A.A,A...A,AA^]. JddfhQacfhhgggahehhhhaB[hg
(...)
my first guess is strand bias. it appears that the alternate allele is consistently on one strand and not the other.
Nice observation. How could I tell mpileup to ignore this parameter ?
Got an answer on seqanswers: http://seqanswers.com/forums/showthread.php?p=45786#post45786. I'll try it tomorrow.
BTW did you check quality of changed nucleotides ? if low they are might not be reported.