Question

Varscan For "Split" Regions In Rnaseq Reads

1

Entering edit mode

11.3 years ago

pulyakhina ▴ 20

Hi everyone,

I'm using Varscan to call SNPs in my mpileup files generated from RNAseq data. The signs "<" and ">" in mpileup correspond for regions that fall within pieces of a split RNA reads, and these regions are calculated in the total coverage that is in the fourth column of an mpileup file. When I look at the Varscan vcf file, I see that the total coverage per in vcf files includes those < and > (I'm not talking about low quality bases, which are nicely removed by Varscan and I can see that the coverage in vcf file is sometimes smaller than the coverage in mpileup file).

However, I think that these "split" regions should not be considered as real coverage. Does anyone know whether I can tune Varscan in such a way that it takes this into account and does not include the coverage in those "split" regions into the SNP coverage?

Thanks in advance,

Irina

varscan vcf rnaseq • 2.9k views

ADD COMMENT • link updated 11.2 years ago by dankoboldt ▴ 140 • written 11.3 years ago by pulyakhina ▴ 20

0

Entering edit mode

Why don't you want to use split reads? However, if you don't want to use them, you can just remove them from the alignment file.

ADD REPLY • link updated 5.8 years ago by Ram 44k • written 11.3 years ago by Fabio Marroni ★ 3.0k

1

Entering edit mode

I do want to use split reads. But when a read is split, the part in between the pieces of read that are actually mapped - this part is considered as a normal standard coverage in mpileup (marked with > or < sign) and I don't want this part to be considered as real coverage.

ADD REPLY • link updated 5.8 years ago by Ram 44k • written 11.3 years ago by pulyakhina ▴ 20

0

Entering edit mode

Sorry, I didn't get it. I don't know how to help with this...

ADD REPLY • link updated 5.8 years ago by Ram 44k • written 11.3 years ago by Fabio Marroni ★ 3.0k

Ram · Answer 1 · 2013-09-03

1

Entering edit mode

11.2 years ago

dankoboldt ▴ 140

Thank you for posting this question. VarScan does not count > or < bases for the purposes of variant calling. I just looked at the code to be certain.

It will report the SAMtools-reported depth, but it ignores < and > characters in the pileup.

If you have an example that seems to disagree, please send it my way!

ADD COMMENT • link updated 5.1 years ago by Ram 44k • written 11.2 years ago by dankoboldt ▴ 140