I want to pick out the parts of the rn4 genome beginning 500 bp before a gene and extending 2000 bp into the gene.
Is this matter as simple as choosing the option
Create one BED record per: Whole Gene
and then changing the ouput by deducting 500 from the start position and changing the end position to start position + 2000?
E.g. changing the following BED line
chr1 100000 100500 NM_019248_up_500_chr1_134302140_r 0 -
into
chr1 99500 102000 NM_019248_up_500_chr1_134302140_r 0 -
?
Edit: if the gene is shorter than 2000 bp I should not change the end position.
hi, I took the liberty of changing your example into one that is easier to read. This way, you will get more answers.
seems like your example doesn't match your words. if it extends 2Kb into the stram, shouldn't the 2nd number be 97500? and if you're using the strand information, then you'd want to use 200000 - 2000 to 2000000 + 500.
sorry brent, it was my fault, I've edited the question and made an error. It should be fixed now.