Error in BLASR 'name in reference not unique'
0
0
Entering edit mode
7.6 years ago
biomagician ▴ 410

Dear all,

I've tried this:

blasr sx.fa.gz output/genome/ref/seq/celegans.fa.gz -out output/ -unaligned output/unaligned.txt

but got the following error:

[INFO] 2017-06-06T14:18:51 [blasr] started.
Error, reference with name "" in the reference genome is not unique

Can anyone help me, please?

Thanks.

C.

blasr pacbio mapping error alignment • 1.3k views
ADD COMMENT
1
Entering edit mode

Have you checked the fasta headers in your file? Many program ignore text after first white space so may consider the headers non-unique. Replace the spaces with an "_" and see if that helps.

ADD REPLY
0
Entering edit mode

I have checked the fasta headers in my file. There are exactly 7 fasta headers. They all start with the chromosome numbers:

"I" "II" "III" "IV" "MtDNA" "V" "X"

followed by a space and more text. For example, if I import my genome in R using genome = seqinr::read.fasta('genome.fa'), and type:

names(genome)

I get:

"I" "II" "III" "IV" "MtDNA" "V" "X"

ADD REPLY
0
Entering edit mode

Sounds like a bug. Possibly, check to make sure you don't have unnecessary newlines in the middle of the file. But also grep for ">" to see how many there are.

ADD REPLY

Login before adding your answer.

Traffic: 1705 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6