Hi,
I've data from several RNAseq experiments. I've mappend the reads onto reference and want to screen for SNPs with this. I'm able to catch them by visualising the alignment in IGV, but I cannot find them with samtools:
samtools mpileup -ugf gem/$s.fa gem/${s}.bam | bcftools view -bvcg - > gem/${s}.raw.bcf
bcftools view gem/${s}.raw.bcf | vcfutils.pl varFilter -D 10000 > gem/${s}.flt.vcf
I think it may be due to variable coverage and low depth for RNAseq. Could you recommend me some other tools that might be used with RNAseq reads?
UPDATE/SOLUTION:
I've wrote small script solving the problem. It incorporates only coverage and frequency of to call heterozygous sites. Hope someone else will benefit from this.
Regards,
I was trying your script, But got following error. I have no idea about python.
cat test_place/accepted_hits.mpileup | python heterozygosity.py -o test_place/het_test -f 0.2 -d 10
Traceback (most recent call last): File "heterozygosity.py", line 105, in <module> main( sys.argv[1:] ) File "heterozygosity.py", line 101, in main parse_mpileup( parameters["fpath"],parameters["out_base"],parameters["minDepth"],parameters["minFreq"] ) File "heterozygosity.py", line 53, in parse_mpileup contig,pos,base,cov,alg,quals=line.split('\t')#; contig,pos,base,cov,alg,quals ValueError: need more than 1 value to unpack
Could you help me ?
something is wrong with you mpileup file. each line should contain 6 columns separated by tabs. check that.
It has 6 columns; head testplace/acceptedhits.mpileup
contig_10 86 C 1 ^S. C
contig_10 87 A 2 G^SG C@
contig_10 88 C 2 .. CC
contig_10 89 C 2 .. FC
contig_10 90 A 2 .. FF
I have "><"values in 5th Column, What does it mean ? From the pileup format I can't find a explanation for this.
I know what may be the problem. The script is updated. Let me know if it works.
Thanks for your time. But unfortunately it doesn't work. This time every line is printed as an STD error and nothing inside output file.
Err: wrong line: contig10, 86, C, 1, ^S., C Err: wrong line: contig10, 87, A, 2, G^SG, C@ Err: wrong line: contig_10, 88, C, 2, .., CC