Mirdeep2 Error: FASTA reads file is not in accordance with the fasta format specifications
2
0
Entering edit mode
9.0 years ago

Hi All,

I get this error from while running mirdeep2. I have tried two data sets and the same error results. It says:

First line of FASTA reads file is not in accordance with the fasta format specifications
Please make sure your file is in accordance with the fasta format specifications and does not contain whitespace in IDs or sequences

***** Please check if the option you used (options c) designates the correct format of the supplied reads file earthwormShort1.fa *****

I ran a perl code to delete whitespaces. I also tried this same data sets on other tools like sRNAbench and I had no error. Do I need to tweak the fasta file in a particular way for mirdeep2 to work with it?

Thanks!

RNA-Seq software-error • 5.5k views
ADD COMMENT
0
Entering edit mode

You should post the first line of the FASTA file.

ADD REPLY
0
Entering edit mode

Here is a few line from the fasta file. I believe Mirdeep wants it formatted in a some kind of format...

>cel1_count=3
TGCCTTGTCTGTCCTAAAAATC
>cel2_count=9
GTTAAGTGGGAAACGATGT
>cel3_count=7
CCGACCTTGAAATACCAC
>cel4_count=7
TAGAAATCCACTATGCTTTGG
>cel5_count=5
CGCGGGTGAGCAGCCTGGTAGCTCGTC
>cel6_count=3
TCCTGTTTTGTAATCGGCTGCA
>cel7_count=4
TACCACGTCCAAGGAAGGC
>cel8_count=3
GGCCGCGTGGCCTAATGGATAAGG

Kindly note that the first header line is not indented as shown. I cant tell why this note editor indents it after I hit submit.

ADD REPLY
1
Entering edit mode
4.8 years ago

$ dos2unix reads.fa

sovled my problem!

ADD COMMENT
0
Entering edit mode

This solved my problem too. the problem is the trailing ^M at each line of the file (you can see it using cat -v file.fa | head). The ^M is a carriage-return character which originates from windows. dos2unix replaces it with a single newline. it can be installed using apt-get install dos2unix

ADD REPLY
0
Entering edit mode
9.0 years ago

You should look at the file format description in the documentation. It looks like the mapper.pl script expects the FASTA identifiers to contain two underscores which delimit three fields such as:

>PAN_123456_x969696
ATACAATCTACTGTCTTTCCT
ADD COMMENT
1
Entering edit mode

Thanks for your response, but then, how can we change our .fa files to fit the mirdeep2 description?

ADD REPLY

Login before adding your answer.

Traffic: 1890 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6