Is There A Samtools/Bcftools Setting To Call Variants No Matter How Infrequent?
2
0
Entering edit mode
13.1 years ago

I can't get this position to be called by the samtools mpileup/bcftools stack.

I get the variant if I ask for all positions (not just variant):

samtools mpileup -u -f data/refs/myRef.fa data/bam/myBamFile.bam | bcftools view -

(showing one position for brevity)

myRef    21    .    C    T,G,X    0    .    DP=3796;I16=3730,0,55,0,135424,4994460,1972,72624,42241,1163355,634,7544,26664,192714,488,4340    PL    0,255,199,255,217,199,255,217,199,199

but if I ask for only variant positions I don't get anything.

samtools mpileup -u -f data/refs/myRef.fa data/bam/myBamFile.bam | bcftools view -v -

It is filtering but I can't figure out which filters to turn off.

The pileup itself is here:

myRef    21    C    3796    ...................................................................................................................TT........T.........T.....TT..T...T.T.T.....TT.TT...T.....T.TT..........T..TT......TT.T....T...TT.TT......T..T..T.T..TT...TT....TT.T..T..TT.TTT..........TT......TT............................................................................T...............................................................................................................................................................................T...........................................................................T................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................G.....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................T..............................................................................................................................................................................................................................................................................................................................................................................................................................    HG7GHHDFGFHHHGHHBH8FHGHHHHFDHHHCGEHEH6>HHHH4HFHHDHAHHHHFGCHH<G@GHCHH:FD?EHHEGHH7EHHHGEE?;1HGGHB14EDHHFEE:HH?HH<GHBFEHHHGH>HEHHGHHGH@EHGGB>HFH<GEH>HHGHFHFEHGHBGHHHHHHGHHEHHGHGBGHF@HFHHHHGEH4HDGHHEFDEHHEF@CBG/HHHHHHGHDHHH?HGDFHHHHF@3HGHHHHHHHE=:HEHHHHHHE@DHHHHHHHGHDHEEHGHFEBFFHGDGGFD<GEHHBHF=HGHHGGHHH/FHHEHEFH;HDFHEHHG?BEHEHHHDHHGHHEH;EHGEHDFD?DBHFHGGEHHC:HFHHGGHHGHBHGEHFHFHEEHBHHHHHDHHH6HHEHH8EH>HHHFBGHHGEHGGE?H0HF<HGGHFHFBHHH>HHBBHBHHHEHHGGB3:HHE7@=@EGEHHHHHHFHHE>HHHCHHEGEHHGHHBHEDHHDEEHDGHEG@GCH;HFHH>5EC>FHHGHGHCEHCHBHBCH=B<GHHDHEHHDEHFGHHFEHHFHHHEEH?GGHHH3HEEC?HGF:GH7EHCHHHEGHE4HH??>EEBHCDGBHHDHDG4?EG9HGGG@EEEHB1EHHHHGGDHHHCE6GGHHHFGHHFHGGHHCHHFHFH/4GGGEHFGHHGHH9EHHHHEGFGHGEE?G>EGEHHH@BEE?HGEGGHHGHDFDEHHHHFEHHEHHHHGHHHFGHHHFCHHHHGHHFHFH2HHFBHGHHHHGHBHHFG?HHH;H4EHCHHHHHBGHEHE=GGHHGCHHF2HHHFHBHHHHHGHHD2H<HH>HFHEHG9HHEHEHHGGHHCHEGG3HHHEHGHGEHGFBHGHG@GH>DHHH1HHHHHHE@HHHH0EBHBDE=FHH?GHHEHHHHHHBHHDFH>FG@EHFEHEAFHHHGHHEGHH<GGHHHH@H;HHHGH6HHCEHBEHGH>E;H4GHHBHDHH<F9FHEHHHHHHDE7GHHCHEHEEGHEHHHCHHHD;8E@G?HH9HBHBH>C@H?HHE@HHEHBGHEFEHEBFDHHEHH9HHH>GFHHHH;EHHBHHHHF6EGHHH@HEEGHBHFB>EHHH@HBHBHHHHHHEEHHFE?HHDEHGHEHHGH=HGG@GHGEEH7F:HFH:EHH@HEHGHECGBBGEBEF>FEGHHHEDHHEH/BHHHHEHGDC,<BBHCHFBHDGG2?DHHH?GF9GBGHGH<HHHHHHGGHHHEB@EGHCG=EGEHGG;CBBECH7EEGGAHFGEHHH=?EGH9HHHHFGGHH8HGHE83H>H>EGHFGHGGHHEGHGDHHHHGHGHH=EFGEHHH7?@HHHF2HEGGHDHHFFHHCHHHHFHHHH.DH;?CHHHHHHFHDHHH2H6HFFGB9EGEHEHHHEBHHH@H6H=HEDBHDHHHHE:HHHGCHEGHHEEEHHGHGHHHHDHBEHHGHEHHDGEHHFHHHDBHHHBHHHHFHHHH?HHCH?HH25/FHD4HHHDDEEHEEHHDHHEEDEHHH?DH2HHHHEBBHHBH@HHHDH@HHEFHHGGGHHHBF>F?HGHEGHHGHGHGDHHHEFHHHHH@HHDDGHG8C;GGEEH8H@HDHH<HHEHHH4EGHHBHBHGGFH@DHFH=EEHHHHHH?HGHHHHHHHGEFF,GG?H6GDBH@HHFHHHHHHHEHHGDGHBE<GGHHHEHHEF4DFGHHEHGH0HFH=GGGEH/HGDGHHBHEHHHHDEHCGHHEH?HFD9@FH3BHHFEEFGH>HHHA6GEH@2B/<EGEHHHH8HHFC/GGDFHEHHEBE=8HFEHBHG=HGHHBHEH?EHHHGHEHHFGHEHHE%H<HHHBH8HEEHE<DGG?@EGEHGHBFHHGBEEH@E<GH?HHGFDFH9B6GEBHHFHGFHHHHHHGGHFGHH@=GHFHHECEHFFAHHEFHE*G/?HFHBGF?G7G:GDHDHHGGDHHE@GHHG@HH?H>HF/FEEH3EFHHC;HHH?;GHHFEEF4EHH<;E=H9EHFCHGGHE>GEEDHHHHEFHHHGHHD?GHHEBB8EEBHBH>HH;FDCGHGGCC?HHGHHHEFHHHEHDHE?@HFE2HHDHHHEHFG:CDF:6>E>GHFFH9EDHH4EHHF6HD7HHHGFGDH4GDHEHHHEG6HGHH*HHHHEFGH.HHHFHHHBCCHB6HG=HHHHHGHHGEDHH6FGH>FDFHHFHHHHFEHHGHGHDHG?7HDHH;4@HEEHHHGGH:CEHFHHHHGHH:H2<HFGEDEEHHEFHEH?HEGHHH>BHCHBHHHHFHHFEH0HHHH=BC6HHHGHGGBGHEHHHH:EAHFHEHHHHHEHEHHHHEHHGHFGHHHEHHHHH?HHEH7HH4?HHEHEHHEFFED=GHGD;HHBHHHGHHFHHEHHGCHBFGHHHHEHGHEHGHE7HEHGFHHHF8HAGH=HEFGHEDHHHCHHHHHHEHHHDHEHHEDHECHCED:HEH:HHHHGGHH;;FEEHEEEHEHG@DHHDH5HFFFHHEDHHEEFHH=FHEHBHHHHHDHHHEGFDHGFHHHHHHFCHEHHH?G:EFGEHCGHGHHGHHHGE?GH*H>HHGHHHEFGHHEGGHHHHHH>GGH@HH=HHEHHBEGEHHDHHGHHHEHH9EDHHEGHHHHGHHGHHFECHH:HFGHHE9DE6HFHHH;EHE>HEEFEHBHHHHEHEFEEHHHHHG0@HC=HHBHBEGHHGH:BFEHHHFHH:HBGHDGHGEEHGHEHGGE6HBHHCEH?EB7GHEHFHH?GBG=@EEEBCE?HBHH?HEFHHHGHHHH6EHGHGGGGHHBEEHHCDHHHE=HHH:.HFHGGHEHHHGFGFHBHGHD@HHGHFBHHBHFHEGHHGHEHH9H3HHHHHBHFCHFHGE2GHHHHH<HHEHHCDH>EHHHBHHH>FECH0?EDHDE@HFGH<HHHFEFEHEFAHH@HEHGHHHH@BGEHHHHEHHEGHHHHHH?HH4FHHHEHHHHDHHHHE?FHHH?E:GHHHHHFHGHHHHEHHHHHHEG,EHBEHEHHHGDHBH8GDHH2HHGHHHHGHHBBEH;6HDHHH3H:HHH2HBHDHHHFHEHGHGFHHHGHGHH9HDHHE9HHE@HHHHHHHGHHHHAH=HH9HGECHBGBHEEHHHH@BHHEHHHGHHHHHHDHFHG@E=HD77DEHEH0HEDGEDEDHGHHHBHHHB;EHE)HGEEEFE:HHHEH7HEHHGHHHBHHGHGFHHGEHEHHEH@HEEHHH@?H@DHHHHFEEGC>GH9DH5H=H:HHHGG?H?EHHG;GEEHHHDGHHHHHHE@FCDHHEHHF9HHCCHGHEFHH=HH=G;HE=>HGHH<HHHHGDEF7G-GEH:GEHEFHHHHH>G4HGHHEGHHHFEF;HH@HHHHECHHH<HHHEHFE<GDFHHE'HHE=GHHH@9H@CHHHFHHHH>GH=HHHH?HH4HHE?GHHHH>HGDACHG2GEHHEEDGEHDHH5HH8HEHHHH>BHHBEH?DHBHH7EH1EHFHG:HHEGH<CHHGHH:GHH<=HGHGHH>EHHEGHGE=GFFHHHGHBDHHGGHH:BHHHEBHHHHEHHHG?HGFHHBHHHG>HFEEGGEHHHEGEFHHEHHGG6HGHGHHHHHDHHEGHAGFE1DEH:HHHDGH2G8CEFHHHHHGHGHHHBHHGAFHHGHH?HFHGGHHGHH?HH=HGHHEH;EFFGHDGHEEHHHHGHGHHHFGHE<HH?GHBHGHH>HHGHGFHFGHGHG<HHHHHH@HH,C@HGHABHHHHHHHGHHGEHGGHDHGBCDBGGGHHHHHHHGFHGHHGDGHBHGHHHHGHHEHDHFHHHHGHFHHHDBHGBHH?GHHHHHHGHHGG7HGHHHHGG=GC5HHHH?H:5?GHEHGHFGHFG
samtools bcftools • 6.5k views
ADD COMMENT
1
Entering edit mode
13.1 years ago
Doctoroots ▴ 800

Hi Jeremy, this could be a possible duplicate of this question, where the answer is that the use of BAQ by samtools reduces some true positives together with the false positive reduction and it is suggested to user either -B or -E in order to fix it.

also, your problem could be due to high coverage filtration, in which case you should use the -D / -d options to set the maximal accepted depth.

ADD COMMENT
1
Entering edit mode
13.1 years ago
lh3 33k

Bcftools assumes your sample is diploid, which greatly improves the power for variant calling from diploid sequences. Your SNP has <2% frequency, even approaching the sequencing error rate. There is no way bcftools can call it. If you really want to call it as a SNP "no matter how infrequent", you should just parse the I16 field (first 4 numbers are identical to DP4).

ADD COMMENT

Login before adding your answer.

Traffic: 2683 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6