Extract RNA Support for Somatic SNV Identified using DNA
0
0
Entering edit mode
4.7 years ago
dario.garvan ▴ 520

I have a data set which has adjacent normal and tumour samples for both DNA and RNA molecules of cancer patients. Somatic variants have been called using the pair of DNA samples for each patient. I am interested in reporting only the variants which have support by RNA:

Each variant has consequence predicted by a tool such as Variant Effect Predictor. Then, support is determined by:

  • Missense mutations should be seen in the cancer RNA sample.
  • Stop-gained mutations should have reads not containing the mutation in the normal RNA sample, because the mutation could be subject to nonsense-mediated decay in the cancer RNA sample. I suppose in the absence of a matched normal RNA in a different data set (extremely rare to have adjacent normal RNA in data sets), I could look for reads not containing the mutation in the cancer RNA sample, assuming the tumour purity of the sample is substantially less than 100% and many normal cells were sequenced.

So, I am wondering about the existence of a tool that takes as input a VCF file and one or more BAM files and, for each variant in the VCF file, calculates AD and DP numerical summaries for the one or more BAM files provided. What software can I use?

SNV DNA and RNA Integration • 1.1k views
ADD COMMENT
0
Entering edit mode

If you called your DNA variants, why can't you call RNA variants and compare?

Also, why do you need a separate tool? Many variant callers allow you to specify the exact positions you are interested in.

ADD REPLY
0
Entering edit mode

Many studies suggest that a specialised analysis is required for identifying variants using RNA data. So, I would like to avoid using yet another variant calling tool and simply extract the supporting read numbers directly from the BAM files.

ADD REPLY
1
Entering edit mode

Although a specialized analysis may be required for RNA, most people use the same tools. For example, GATK Best Practices for DNA and RNA use the variant callers.

You can run samtools mpileup to generate a textual format of an alignment if you just want to check the bases at specific positions.

ADD REPLY
0
Entering edit mode

Indeed, bcftools pileup with option -a INFO/AD is producing the summaries I'm interested in seeing.

ADD REPLY

Login before adding your answer.

Traffic: 4277 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6