Question

Convert Fasta format to MEME-motif format

4

Entering edit mode

5.8 years ago

kashiff007 ★ 1.9k

Repeat sequences, such as 45S rDNA (gene) and many tandem repeats distributed throughout the genome. These repeats can identical in its sequence or they may vary at few nucleotide positions.

I know they can be scan through FIMO (which is program from MEME-Suite) for their location in whole genome. But FIMO required MEME-motif file which mainly contain "letter-probability matrix". MEME motif file example is shown below:

MEME version 4

ALPHABET= ACGT

strands: +

Background letter frequencies

A 0.25 C 0.25 G 0.25 T 0.25


MOTIF MA0002.1 RUNX1

letter-probability matrix: alength= 4 w= 18 nsites= 18 E= 1.1e-006
0.611111 0.000000 0.055556 0.333333
0.555556 0.000000 0.111111 0.333333
0.222222 0.166667 0.222222 0.388889
0.000000 0.111111 0.000000 0.888889
0.000000 0.055556 0.944444 0.000000
0.111111 0.000000 0.000000 0.888889
0.055556 0.000000 0.888889 0.055556
0.833333 0.111111 0.055556 0.000000
0.111111 0.388889 0.277778 0.222222
0.333333 0.055556 0.500000 0.111111
0.111111 0.222222 0.111111 0.555556
0.277778 0.222222 0.222222 0.277778
0.111111 0.055556 0.722222 0.111111
0.388889 0.166667 0.055556 0.388889
0.055556 0.000000 0.111111 0.833333
0.055556 0.777778 0.000000 0.166667
0.777778 0.000000 0.222222 0.000000
0.277778 0.611111 0.055556 0.055556

I have read the MEME motif format (http://meme-suite.org/doc/meme-format.html) in detail. I have problem in understanding the concept behind generating the "letter-probability matrix" in MEME-motif format. They also have not mentioned any software/script which directly converts fasta format to MEME-motif.

So, I have two questions regarding this:

How to convert single fasta file into this MEME-motif file?
In case I have multiple copy of fasta file of such tandem repeats which are varying at few location only, how to convert these fasta files into one MEME-motif file?

MEME-suite fasta motif FIMO • 3.6k views

ADD COMMENT • link updated 20 months ago by Ram 44k • written 5.8 years ago by kashiff007 ★ 1.9k

score 4 · Answer 1 · 2019-02-17

4

Entering edit mode

5.8 years ago

kashiff007 ★ 1.9k

Follow these steps:

Generate background files by: fasta-get-markov
Use iupac2meme to generate MEME motif file

ADD COMMENT • link 5.8 years ago by kashiff007 ★ 1.9k

score 3 · Answer 2 · 2019-02-17

3

Entering edit mode

5.8 years ago

kashiff007 ★ 1.9k

Full FIMO Pipeline Step

fasta-get-markov sequences.fasta > sequences.bg
mkdir temp
fasta-grep '.+' -dna -o < motif_sites.fasta > temp/motif_name.txt
sites2meme -bg sequences.bg temp > motif_name.meme
fimo --bgfile  sequences.bg motif_name.meme Genome_to_be_search.fasta

ADD COMMENT • link 5.8 years ago by kashiff007 ★ 1.9k