Hello,
How can I compare the novel miRNAs detected by miRDeep2 across different samples (based on their read counts)? For example, I have 3 samples(sample_1, sample_2,sample_3) from Condition-A. How can I check the read counts of say "novel_miR_1" from sample_1 with other 2 samples. I can't do this with the help of provisional id's, since they are different in every sample. Is there any other way to match these novel miR's across samples?
Thank you.
Thank you very much for your reply. Yes, I agree with you the mirDeep2 output is quite obscure and elusive for novel miRNAs. Because the provisional id's contains a random number after the chromosome number, which is different in every sample.
What I did was to remove the miRNAs having the same precursor sequence, and selecting one from the miRNAs having an overlap of more that 50% among them. In this way I was able to remove the duplicates. And then I renamed all the novel_miR across samples by considering the miRNA sequence and their co-ordinates.
But, I would consider your point of trying out other software's that predict new candidates from smallRNA-seq data. I can try this and possibly compare it with the results of novel_miR's from mirdeep2 and pick the most likely candidates.
Be carefull with removing quantification spots in your matrix. If same miRNAs have been quantified as mature or precursors or if some sequences quantified as different spots do really belong to the same transcript, the values in quantification matrix will be squeued and not representative of the real situation.
The proper thing to do is to quantify unique transcripts as a whole from the begining