Hi,
I have a tab delimited file with GWAS results and I am trying to annotate the variants using Annovar but my format for INDELS is different, and I get my indels split between the invalid_input, where I find these SNPs in my file:
1 63735 63735 CCTA C
1 251627 251627 AC A
1 760811 760811 CTCTT C
While they should be like this:
1 63735 CCTA 4C 0.339 rs201888535
1 251627 AC 2A 0.172 rs72502741
1 760811 CTCTT 5C 0.0417 rs200712425
and some in the "filtered" file like this:
1 36549207 36549207 A ACT
which are in Annovar like this:
1 36549207 A 0CTC 0.9076 rs143406521
1 36549207 A 1ACTC 0.9076 rs143406521
I am wondering what is the best way to convert these formats, and if there is a standard way/a script to do this as I am afraid to get it wrong and convert only insertions and not deletions or the opposite... thanks so much for your help.
Hi, thank you Igor for your reply, but I don't have a VCF file only a simple text file so I can't really use the utilities for VCF right? Maybe I can anyway? Thanks again
I didn't realize you don't have a VCF at all. Not sure how you can do this with a custom format. My first suggestion would be to create a VCF file, but I am not sure how easy it would be, especially with indels. Some options here: bed to vcf format conversion
Great! this would work, thank you!!
Sorry @igor I tried your tip for using ANNOVAR about INDELS but I am getting this error
But I don't know what does that mean