Entering edit mode
8.5 years ago
ShirleyDai
▴
50
Hello, I have some vcf files generated from GATK mutect2. I can use GATK VariantsToTable to extract start position of each variants. I wonder if there is an easy way to extract both start and end position in my vcf files. Thanks
No. I need to extract SNPs and Indels (some has >10 bases) as the following format:
Single nucleotide variants
chr4 150 150 A T
Insertions
Use ‘-’ in the reference_allele field and start/end coordinates must indicate the two adjacent bases in which the insertion occurs between.
chr4 150 151 - T
Deletions
Use ‘-’ in the observed_allele field to denote deletion of the given reference allele.
chr4 150 150 A -
I've updated the answer.
Cool! Many Thanks.