How to extract promoter sequences from a plant draft genome?
0
0
Entering edit mode
5.2 years ago
Kumar ▴ 120

I would like to extract promoter sequences from a draft plant genome. I found many web tools for the same, but those are accepting small file as a input sequence (maximum 5 mb). I have 350 MB sized plant draft genome, therefore please suggest me any off-line tool for the prediction of promoter sequences.

Thanks in advance.

promoter genome • 2.2k views
ADD COMMENT
2
Entering edit mode

I faced the same problem around year ago working with Nicotiana tabacum genome (annotated scaffolds). At that time I did not find the solution so just tried to write my own bash script that utilizes the common software like samtools and bed tools and outputs multi fasta with promoter sequences. Here is it: https://github.com/RimGubaev/extract_promoters Hope this could help!

ADD REPLY
2
Entering edit mode

Looking at that code, it is mainly extracting flanking sites from annotated genes. While this is broadly addressing the goal, this should not be confused with finding bona fide promoter sequences, merely the regions where they would actually be found.

I say this just in case someone in future comes to this thread and runs that code as a 'black box'.

ADD REPLY
0
Entering edit mode

Yeah, that's correct, thanks!

ADD REPLY
0
Entering edit mode

Thank you @rimgubaev

ADD REPLY
0
Entering edit mode

Many thanks, dear colleagues!

ADD REPLY
0
Entering edit mode

Do you just want 1kb upstream of genes? That's one line of bedtools.

ADD REPLY
0
Entering edit mode

yes indeed @Asaf, could you please elaborate it.

ADD REPLY
0
Entering edit mode

Take a look at bedtools flank

ADD REPLY

Login before adding your answer.

Traffic: 1403 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6