I am using this pipeline to make a vcf file. but m having problem at step 4. bcftools view -bvcg my-raw.bcf > my-var.bcf
it says -b is not a valid parameter
.??? what to do ?
Convert SAM to BAM for sorting
samtools view -S -b my.sam > my.bam
Sort BAM for SNP calling
samtools sort my.bam my-sorted
Run 'mpileup' to generate VCF format
samtools mpileup -g -f my.fasta my-sorted-1.bam my-sorted-2.bam my-sorted-n.bam > myraw.bcf
Call SNPs...
bcftools view my.var.bcf | vcfutils.pl varFilter - > my.var-final.vcf
Step 1: check if you have a recent bcftools version
Step 2: if it is the recent version, check if the parameter used are still valid by typing
bcftools view --help
It looks like hafiz.talhamalik is following an outdated tutorial in variant calling using samtools/bcftools. In the recent versions this looks like this:
fin swimmer
yes my tutorial is old but the command you gave is giving an empty vcf file.
Hello again,
what do you mean by "empty". Just the vcf header or really a blank file?
fin swimmer
completely blank one. no header, nothing
Have you tried running them separately??
set the
--ploidy
if required in the variant calling step.yes i tried them separately but still empty file.
Hi, have you checked the sorted bam file? The command you mentioned for sorting is not correct. You can either redirect the ouput using
>
or can use the-o
option. Also index the refernece fasta file with samtools in the same directory.yeah I tried that. sorted bam file is fine.
Could you please post the output of these commands:
$ samtools --version
$ bcftools --version
$ samtools view -H my-sorted.bam
$ samtools flagstat my-sorted.bam
$ cat ref.fasta.fai
fin swimmer
This looks like you're trying to align a fasta sequence and not fastq. AFAIK
bwa
can do this. But I'm not sure ifbcftools
can call variants from this file, because the base quality values are missing and ifsequence.fasta
contains only one sequence you only have one read per position in the alignment file.So could you please tell us more about what you are trying to do?
fin swimmer
I have downloaded all the fasta sequence of salmonella from ncbi and now i want to compare snp's from those fasta files with my sequenced data's vcf file.
So you have downloaded assembled genomes rather than separate reads?
You can do variant calling from those but it would have been absolutely important if you mention things like this in your first post.
yes assembled genomes.
You have wasted a lot of your and our time by not telling us that earlier. I have added an answer to this thread.
oho. M really sorry for that..!
argh, it should be
samtools flagstat
(without s). I've edited it.