Lastz.... using the xmask option
1
0
Entering edit mode
8.7 years ago
mafireyi ▴ 80

I am trying to compare two genomes by aligning them together using lastz. I have used the following command

lastz path/to/my/assembled/genome/genome.fa[multiple] path/to/other/species/genome/ref.genome.fa.gz > output.maf

this is the error I am getting is;

FAILURE: bad fasta character in /path/to/other/species/genome/ref.genome.fa.gz (ascii 1F)

I am thinking this might me a non-ACGT character error and therefore have to use xmask or nmask.

My question is how do u used xmask?

thanks

alignment • 2.0k views
ADD COMMENT
1
Entering edit mode

Hello!

As far as I have understood you need to align two genomes from UCSC.

Below you can find their detailed guide for the alignment.

http://genomewiki.ucsc.edu/index.php/Whole_genome_alignment_howto

The mistake you show, "bad fasta character" in many cases means some substitution instead of ">" sign. Check it.

In the web-site above they write about masking the following:

MASKING: Both genomes have to be repeatmasked and masked Tandem Repeat Finder (trf) first

Some people also had problems with LastZ before, like here: Error message using LASTZ in Geneious

I would suggest another tool in case you will not find a source of your problem:

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4783527/

Good luck!

ADD REPLY
1
Entering edit mode
8.2 years ago
ifreecell ▴ 220

The bad fasta character error come from non-ACGT characters in your fa files, to avoid this error, you just pass "--ambiguous=iupac" option to lastz, for details see lastz documentation.

lastz genome1.fa genome2.fa --ambiguous=iupac --format=maf > output.maf

ADD COMMENT

Login before adding your answer.

Traffic: 1808 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6