Entering edit mode
5.1 years ago
ravi.eshwari
▴
10
HI, I need to download 2000 upstream promoter region sequences of few genes but when i specify the gene along with its coordinates and chromosome the output sequence file i get contains all other coordinates of those genes except the one i mentioned in the input how do i download exact coordinate sequence file? please let me know
thankyou!!
You can simply obtain the coordinates of the entire gene and then use
bedtools slop
to extend the start of the gene by a desired number of basepairs in upstream direction to get an approximation of the promoter sequence. Be sure to use it in a strand-specific fashion to respect the strand the gene is on.