Hello everyone,
is it possible to do SNV and indel calling on differentially expressed genes with RNA-seq data and without reference genome? what are the best pipelines?
thank you
Hello everyone,
is it possible to do SNV and indel calling on differentially expressed genes with RNA-seq data and without reference genome? what are the best pipelines?
thank you
Hi, I am running a similar project. First to create reference transcriptome, you should assemble the reads by de novo assembler such asTrinity and Bridger. I guess you did this step because deferentially expressed genes are available to you. Then, the second step is to create supertranscripts and annotation file, if your assembler was Trinity, the following link (GATK pipeline in trinity pipeline) will be helpful for you: https://github.com/trinityrnaseq/trinityrnaseq/wiki/Variant-Calling Otherwise (other assemblers), you can use lace software to create supertranscripts and annotation file using reference transcriptome. Then, the GATK pipeline is used to create the vcf file.
Please elaborate further on what you are aiming to do, and be more specific on what data you currently have.
If you have X number of bulk RNA-seq samples, you could technically create a consensus transcriptome via de novo assembly and then find a way to perform variant calling in this way. Be wary of the pitfalls of using RNA-seq data for variant calling, though: A: Inferring genotype based on RNA sequnces
Kevin
I want to investigate the SNVs and indels of differentially expressed genes between various genotypes of the same species of plant (18 RNA-seq samples). The genotypes are divided in 2 groups according to their differences of specific metabolites content. Unfortunatley the genome is not available yet.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
do you think that this pipeline will work with aligners other than STAR? what about CLC Genomics Workbench?
Dear sam Sorry for the delayed response. Star is a splice aware aligner and is suitable for the snp calling step. I am not sure. Perhaps other splice aware aligner, such as tophat, may also be appropriate. CLC is a comprehensive analysis package and very useful. I didn’t work with it and I don’t know the details and which aligner it uses at this step.