targeted resequencing data analysis
1
0
Entering edit mode
5.6 years ago
dentepre ▴ 20

Hi,

I have to perform variant calling from DNA sequencing data, but in targeted sequencing mode. My libraries were enriched by set of about 100 genes and I want to find mutations (SNV and indels among them).

What is the best way to perform this kind of analysis? Should I mapped to the whole reference human genome (like for whole exome sequencing pipelines) and then select only interested regions or just create an artificial "short" genome with only interested set of genes and then mapped to this "artificial genome"? Or completely different?

And could you recommend me some papers about analyzing data from targeted sequencing?

Best, d.

next-gen • 1.2k views
ADD COMMENT
1
Entering edit mode
ADD REPLY
1
Entering edit mode
5.6 years ago

Should I mapped to the whole reference human genome (like for whole exome sequencing pipelines) and then select only interested regions

Yes. Always align to the full genome. During variant calling you can specify which regions you are interested in, for example using the -L argument of GATK.

And could you recommend me some papers about analyzing data from targeted sequencing?

If you know about analysis from WES then this is the same, just smaller.

ADD COMMENT
0
Entering edit mode

Hi,

I would like to know that If I am interested in more than one chromosome, then Should I use ? -L chr 1, chr 25, chr 26

ADD REPLY

Login before adding your answer.

Traffic: 2796 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6