I have the sequence for the OP50 strain of E. coli and I would like to compare it to the reference E. coli sequence. Does anybody know how I can do this? I am trying lastz:
First of all, I don't think this dotplot is useful, as the OP50 draft is fragmented and contains 2939 short contigs.
I have no idea of why you aren't getting the rdotplot file. I just replicated your command with the OP50 draft and the K-12 substr. MG1655, and got an alignment and rdotplot file. I had first to replace the first | by _, otherwise the contig names would be mangled by lastz (even then I would get a rdotplot file):
Do you think it does not work when there is only one contig in each
file?
lastz works fine with single or multiple sequences per file, this is not the issue. The dot plot, on the other hand, is intended for alignments between single sequences. Maybe your op50 contig is corrupted or has something that bothers last?
The maf output is a simple text file, you can examine it with less, head, tail to see if it looks correct.
You may also try Mauve to align both genomes, it has a GUI and is quite easy to use.
Hi h.mon,
To clarify, I am aligning a contig that I think might be OP50 but I am not sure. My fasta file anyway only contains one sequence.
I have reprocued what you did:
Do you think it does not work when there is only one contig in each file? or maybe they are not similar enough?
C.
lastz
works fine with single or multiple sequences per file, this is not the issue. The dot plot, on the other hand, is intended for alignments between single sequences. Maybe your op50 contig is corrupted or has something that botherslast
?The maf output is a simple text file, you can examine it with
less
,head
,tail
to see if it looks correct.You may also try Mauve to align both genomes, it has a GUI and is quite easy to use.