Hello researchers,
I am stuck in my project and require an effective solution
- How to extract promoter sequences from rice transcriptome.gtf file?
- How to extract promoter sequences 2kb from rice transcriptome.gtf file?
- How to extract promoter sequences downstream 2kb from rice transcriptome.gtf file?
Thank you
Do you have chromosome/scaffold lengths for rice? and post lines for which you would need upstream and downstream elements. You would need each chromosome/scaffold length, genome sequence and bedtools. use functions
flank
andgetfasta
from bedtools.Basically promoter means up stream of the TSS, and TSS is the annotated start of each transcript. Hence, get start coordinates per transcript (it is the "end" coordinate of in the
-
strand), and then get 500bp upstream which is like the default for promoter approximation. Then use mentioned tools to get fasta sequences.