DNA Motif discovery from a large number of FASTA sequences
2
0
Entering edit mode
6.7 years ago
flyguymail22 ▴ 10

Hello

I wish to find the recurring/enriched motif(s) in a large number (approx. 7000) of DNA FASTA sequences. I have given MEME a try but it does not scale as intended.

Any thoughts/suggestions/recommendations for another software or strategy are highly appreciated.

Thanks. FG

sequence motif alignment • 3.1k views
ADD COMMENT
1
Entering edit mode

Hello, did you attempt to use MEME via command line or the web version of it? In the second case, I would just recommend switching to command line.

ADD REPLY
0
Entering edit mode

Hi, I used the command line version.

ADD REPLY
0
Entering edit mode

I see, then I think you should be just fine with the MEME suite, what is your problem exactly?

ADD REPLY
0
Entering edit mode

Hi, sorry for bothering you but do you want novel DNA motifs for example from ChIP-Seq studies?

ADD REPLY
0
Entering edit mode

Hi. Yes. I am looking at discovering novel motifs from a RNA+Chip-Seq study.

ADD REPLY
2
Entering edit mode
6.7 years ago
EagleEye 7.6k

Try Homer, http://homer.ucsd.edu/homer/motif/

ADD COMMENT
0
Entering edit mode

Thanks EagleEye for the suggestion. However, with Homer, what would you recommend for a 'background' FASTA file.

ADD REPLY
1
Entering edit mode

2. Background Selection findMotifs.pl/findMotifsGenome.pl

If the background sequences were not explicitly defined, HOMER will automatically select them for you. If you are using genomic positions, sequences will be randomly selected from the genome, matched for GC% content (to make GC normalization easier in the next step). If you are using promoter based analysis, all promoters (except those chosen for analysis) will be used as background. Custom backgrounds can be specified with "-bg <file>".

ADD REPLY
0
Entering edit mode
6.7 years ago
c.chakraborty ▴ 180

Hey if you are looking for motif enrichment use centrimo or MEME-ChIP, because there you can upload you sequences from DEG and unchanged genes and look for Transcription factor binding sites, for example or for miR datasets. If you are using MEME or any othe above mentioned tools, you can use the out put for further categorical analysis, as to which genes possess these motifs and their recorded possible function by using GoMO

ADD COMMENT

Login before adding your answer.

Traffic: 2481 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6