Hi,
I'm annotating my variants with VEP. To test, I picked a random rsID and created a test input with all possible options with strand and without strand information.
But I get both 1 and -1 strands in my output and exactly the same amount of rows. Since I don't have transcript information, I also tried --pick_allele
and --pick
options but I still mostly get -1 strand results.
How does providing strand information play a role then?
Input with strand
1 45795027 45795027 C/G +
1 45795027 45795027 C/T +
1 45795027 45795027 G/G +
1 45795027 45795027 T/T +
Input without strand
1 45795027 45795027 C/G
1 45795027 45795027 C/T
1 45795027 45795027 G/G
1 45795027 45795027 T/T
Thank you for your answer. It's interesting that it will be ignored. In documentation, it says strand is necessary.
https://m.ensembl.org/info/docs/tools/vep/vep_formats.html
Let me wildly guess: I think the strand information is necessary to make statements when it comes to annotation of variants, such as overlap with promoters/TSS/TES etc so the strand then tells the tool whether the start coordinate is actually the start of the overlapping feature on the plus strand rather than the end of a feature on the minus strand. For mutations itself it does not matter since, as pointed out, DNA is double-stranded and therefore any mutation always occurs on both strands.
Oh okay. Thank you for the explanation. My results also have problems like reporting variants even though input is homozygous reference. Especially when there is multiple alternative alleles in that position. I thought maybe strand plays a role in this but no, obviously it has a different reason. I gotta check more. Thanks!