Entering edit mode
8.2 years ago
agata88
▴
870
Hi again,
How to divide vcf file into two files or call two consensus sequences from vcf file for hetero SNPs?
Best,
Agata
How do you plan to phase your variants per allele?
For example I have two snps in my vcf, one is hom TT and other is het GA. I would like to get two consensus sequences where one is with TG and second TA. Is there a tool that can do something like this?
Per @WouterDeCoster's question, what if the first is het AT and the second is het GA - how would you want to report it?
hom TT, het AT and another het GA I would like to report as: TAG and TTA assuming that TAG is from one read and TTA from another.
So the tools should also look into the bam file to find out which reads are supporting which variant calls? That's making the story more difficult, as you can imagine. What if there is no evidence to derive phase from the bam (no reads spanning) between position 1 (GA) and position 2 (TC)? How to report that?
I would discard those reads from analysis ...
This is getting weird. What do you really want to do, I mean, what's the goal of the analysis? Why do you want to split a vcf file without biological meaning?
I would like to perform HLA typing ... and wanted to follow this article:
https://bmcgenomics.biomedcentral.com/articles/10.1186/1471-2164-14-355
I might be wrong, I am getting confused here. They are using two "original own perl scripts". I am trying to write that. I have amplicons for whole genes not only for HLA but also KIR. I tested few pipelines which are "ready to use" and I obtained different results :/ So, I am trying to figure out the best solution.
Right, so you performed long range PCR of your targets of interest, followed by NGS library prep. It would make things much more clear if you would have stated that in your original post. Have you tried asking the authors for the perl script? In my opinion it is unacceptable not to publish such an important part of their work.
If this project is something you want to continue, you might want to consider getting a MinION and sequence the longe range PCR products directly without shearing.