Hi,
I'm doing a SNP identification using samtools mpileup tool. I use one reference file and 3 sample files ( all s bam files are run together so I get one vcf file ). I was able to get the .vcf file as the output. The POS
in
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT
gives me the SNP position in the reference genome. However I want get the SNP positions at each of the sample files. Does mpileup has a command in it ? If not, what can I do? ( I read about -D
, but it gives the sample read depth ). I use Python as the programming language.
Thank you for the reply. I do run with the 3 bams.
Assume,
Could you please post 1 or 2 lines from your output? (without header lines (starting with ##), if it is VCF).
If it's not enough, I can email you my whole vcc file. Please let me know.
Thanks for the record.
If I take your example in OP:
VCF would look like this after alignment:
which means
In general, Sample SNPs (bases) share the same position as Reference bases for each line. But do remember that this position is calculated only after alignment and further pruning steps. I hope I understand your question. If not, do let us know.
However let us say, you already have a single VCF with multiple samples, you would like to extract records for only one of the samples, then you can following tool and command: