Entering edit mode
2.5 years ago
MAPK2
▴
50
Hi All,
I have a table with the list of variants and need to create a bed file for these. There are indels and some are even listed by +- bp as in the table below. How do I create a bed file for such variants?
A bed file is a white space delimited file with three required fields - CHR, START and END. You have two fields from which the three can be derived. What is your problem? I don't see any
+
, only-
s signifying an insertion of deletion like VCF pre 4.2 convention.Should I just subtract or add bases for the Pos_End? Right, they have not given + or -. I wanted to search these variants in my VCF file.
Look up some of these on dbSNP or gnomAD to understand how base positioning works - you'll need some math to calculate the End Pos, and some more math to account for the
-
Indels.For example,
2:47803501delC
(C
>-
) is in reality2:47803500AC>A
(AC>A). (https://gnomad.broadinstitute.org/variant/2-47803500-AC-A?dataset=gnomad_r3)You can of course skip this step if standardization in not something you need. You could do just the End Pos calculation using Start Pos and the number of REF bases (I'll leave calculating the exact formula to you).