A fasta file with precursor sequences, (IS this the hairpin file on mirBase?
According to this source, page 6:
- hairpin.fa provides the precursors, and
- mature.fa the mir sequences
So, as put it hairpin = precursor sequences.
a fasta file with star sequences - Where do I get this?
In my pipeline, Mapper + MirDeep2, I did not use star sequences.
If you are using iGenomes, the Mature and Hairpin sequences are in Annotation/SmallRNA
(at least for zebrafish). These sequences did cause an error to be thrown at me by mirdeep:
Error: miRNA reference this species file genomes/Danio_rerio/Ensembl/Zv9/Annotation/SmallRNA/mature.fa has not allowed whitespaces in its first identifier
I removed the whitespace from the headers with sed
:
cat mature.fa | sed 's, ,_,g' > mature.noSpaces.fa ;
cat hairpin.fa | sed 's, ,_,g' > hairpin.noSpaces.fa
I did not filter for rRNA.
Great! Thanks so much!
Did u use RFAM?
No. It was very "quick and dirty" type of analysis, and I did not filter for any repeat elements. Can't help you there sorry.
Thanks so much though! One last question I promise- I have set of normal and disease samples, can I process them using the config file at the same time. Will I get read counts for each sample as result and comparable across? Also, could you just share the results files (maybe just a few lines) with me please? I am using collaborators computer resources so do not ahve too much time to test and try.
Thanks!!!!