SNP calling from pooled RNA-seq data
3
1
Entering edit mode
10.1 years ago
thjnant ▴ 160

Hello,

I want to call SNPs from pooled RNA-seq data that have been mapped to a reference genome. My organism is a bird. Does anyone have any experience with this, which software to use, for example GATK, samtools, FreeBayes, etc?

RNA-Seq • 3.7k views
ADD COMMENT
1
Entering edit mode

At least with GATK you can set the -ploidy option (I've not tried this, but it allegedly works). You'll need a good bit of sequencing per pooled sample to reliably call things in anything but super-high expressing gene, though.

ADD REPLY
3
Entering edit mode
10.1 years ago

There are a few papers that talk about reliable identification of variants from RNA-seq data.

ADD COMMENT
2
Entering edit mode
10.1 years ago

We have developed a method called ESNV-Detect to do this task from human transcriptome data. (ESNV = expressed SNV).

Manuscript is in press; you can download the software here.

Abstract decribing intial results from AACR 2013 meeting is here.

Update:

Manuscript is now published in NAR. Abstract here and full-article here.

ESNV-Detect Workflow: Call expressed SNVs from RNA-Seq data

ADD COMMENT
1
Entering edit mode
10.1 years ago

Hi, I am working on some pooled data as well. I think GATK does support pooled sequencing, as far as you set the ploidy number, check their documents. I used Varscan, and I like it. It generates the standard vcf output, which is very compatible with lots of programs downstream.

Let me know if you have other question.

ADD COMMENT
0
Entering edit mode

Thank you everyone for your very useful comments, I am going to start the SNP calling in the next few days, will let you know about my questions and experience.

ADD REPLY

Login before adding your answer.

Traffic: 2316 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6