Hi everyone!
I was trying to build library with RepeatModeler from fasta seqences (containing only transposons) of some species of fish. Let's say that I've done it, but I'm not sure if it's correct. My prepared fasta file contains 1072 sequences of fish, but the library created by RepeatModeler (file consensi.fa.classified) contains ONLY 8 sequences and I wonder if it isn't too less. Below I add exact command lines to show You what I've done, cause mabye I did something wrong through the process of creating the library. In addition to this, RepeatModeler didn't create .xdf file which I've expected after reading manual. Here's the link to example run: http://www.repeatmasker.org/RepeatModeler.html
Command lines: (Based on http://www.repeatmasker.org/RepeatModeler.html)
- BuildDatabase -name Database -engine ncbi Mobile-elements.fasta
- nohup RepeatModeler -engine ncbi -database Database >& run.out &
From manual:
-database The prefix name of a XDF formatted sequence database containing the genomic sequence to use when building repeat models. The database may be created with the WUBlast "xdformat" utility or with the RepeatModeler wrapper script "BuildXDFDatabase".
However there is no script like "BuildXDFDatabase" and WUBlast was replaced by AB-BLAST, which you have to pay for license.
I hope that someone can answer me, cause I doubt that this library is correct and i I can't go on with my project. Thanks in advance :)