Variant calling with multi samples using RNASeq
1
0
Entering edit mode
8.5 years ago
Picasa ▴ 650

Hello there,

I have RNAseq data and I'm looking for some variants. I plan to call it with the GATK pipeline described here:

http://gatkforums.broadinstitute.org/gatk/discussion/3891/calling-variants-in-rnaseq

The problem is that I have 3 samples and several data (3 paired end) for each sample. Example:

1) Sample 1

  • PE 1

  • PE 2

  • PE 3

2) Sample 2

  • PE 1

  • PE 2

  • PE 3

3) Sample 3

  • PE 1

  • PE 2

  • PE 3

In order to get one vcf file at the end, Iam confused how to deal with these files. I have some ideas, but I'm not sure so your suggestions will help me :)

1) Merge all the PE together so I'ill have one PE data for each sample ?

2) Mapping with star each run and add read group information ?

3) At which point do I merge the sample ?

rnaseq variant • 1.9k views
ADD COMMENT
0
Entering edit mode
8.5 years ago
DG 7.3k

It doesn't matter too much at which point you merge the technical replicates together into sample level data. You can merge the FastQs, you can map and then merge the BAMs with read group data, etc. It might be slightly preferable to merge together the BAMs after adding read group data in case there is anything unique to the libraries that were sequenced that can help eliminate artefacts and false calls down the line.

ADD COMMENT
0
Entering edit mode

Ok so the idea is to apply this step:

2) Add read groups, sort, mark duplicates, and create index

for each run and sample (so 9 jobs in total)

and use samtool merge to produce one sam file.

FInally apply "3. Split'N'Trim and reassign mapping qualities" step with that unified sam ?

ADD REPLY
1
Entering edit mode

You can always do over with the SAM step and directly produce the BAM format and pre process the STAR aligned BAM files with GATK. You can combine the bam files merging with RG tags and then run GATK processing steps for each samples and finally perform the variant calling.

ADD REPLY

Login before adding your answer.

Traffic: 2693 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6