Question

miRDeep2: expression reads and input reads not tallying

0

Entering edit mode

10.1 years ago

SJ Basu ▴ 60

Hello,

I am working with miRNA for quite some time now and use miRDeep2 for it. I have these two queries to which i've searched a lot for the answer but without any success.

The read depths for miRBase hits (both result.csv & expression.csv) doesn't equal to total reads of the input file. I suppose it should be like Total reads= miRBase hits + novel miRNAs isn't it? Or am I going wrong
what is the last part of result.csv signifies?

My pipeline: raw reads-> cutadapt -> rRNA filtering -> mapping to genome -> miRDeep2

Any help in this regard will be highly appreciated. Thanks in advance!

miRDeep2 miRNA • 4.8k views

ADD COMMENT • link updated 2.8 years ago by Ram 44k • written 10.1 years ago by SJ Basu ▴ 60

0

Entering edit mode

What happens when you add in the unmapped reads?

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.1 years ago by Devon Ryan 104k

0

Entering edit mode

Sir,

I am not exactly sure which unmapped reads you are pointing to cause I am performing mapping twice (with rRNA then genome). I do create the .arf file and reads_collapsed.fa from the sam file (which has both mapped and unmapped reads) of reads mapped to genome using bwa_sam_converter.pl (provided by miRDeep2). Now this arf automatically takes in only the mapped reads from the sam file.

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.0 years ago by SJ Basu ▴ 60

0

Entering edit mode

Hi,

I don't know exactly the reason, but I guess it should be Total = miRBase hits + novel miRNA + everything else. You are guessing that everything not in miRBase is novel miRNA, something that is not true. There are a bunch of sequences that are not miRNAs, like tRNA, or fragments of genes, or repeat regions.

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.0 years ago by Lorena Pantano ▴ 380

0

Entering edit mode

Thanks a lot for affirming my belief :) Lorena I knew about this fact but what's I am upset about is I am providing 7.1 million reads of which 2.3 million reads is in 17-22bp region but miRDeep2 is showing only 194346 reads(miRBase + novel). Don't you think its too less? Actually me and a senior of mine we were running to different programs and confirming the sure miRBase hits (he is running miRNAkey). Now is the most disturbing part when I ran miRNAkey on the same data, he got 138 significant miRNA and I got 78 significant miRNA! I am clueless about where the mistake is happening

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.0 years ago by SJ Basu ▴ 60

0

Entering edit mode

whoo...that's a lot. That only happens when the species is wrong. About the number of DE miRNAs, don't know if you are using the same tool to that or not. But if not, that is normal.

I don't know if you are saying that you both got the same number of hits, or you got much less that your colleague?? if so, it seems you have something else there, maybe other species miRNA. If not, something happened with miRBase.

I will happy to run one sample with another tool if you only tell me the species, just out of curiosity.

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.0 years ago by Lorena Pantano ▴ 380

0

Entering edit mode

okay so miRDeep2 and miRNAkey outputs may not be same - I got that. But the problem is when my colleague is running miRNAkey its giving 100000+ reads in 138 miRBase hits (significant ones that is p value < 0.05) and when I am running miRNAkey on the same data in my computer its giving 8000+ reads in 78 miRBase hits (significant ones that is p value < 0.05)! I updated miRBase through update option in the miRNAkey GUI. I got this version of miRBase http://mirbase.org/ftp.shtml and grep ">osa" to separate out my species then indexed it. Oh yes my species is Oryza sativa.

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.0 years ago by SJ Basu ▴ 60

0

Entering edit mode

Well, if you are using exactly the same, and get different results....that's weird. Some reason could be tools version, and parameters. Or your input files for some reason are different. It can not be more than that....this is weird.

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.0 years ago by Lorena Pantano ▴ 380

0

Entering edit mode

I did check my parameters and tool version, they are fine and matching. I am kind of in a fix with this problem! Though, me and my senior we were discussing the fact that may be our filtration step is giving different number of reads to the input file. Meanwhile I tried miRPlant and output looks pretty satisfying. Can you provide me with some opinion on this tool?

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.0 years ago by SJ Basu ▴ 60

0

Entering edit mode

I read about that, not my expertise because I never did plant miRNA, but it seems good. If you are getting better results, just inspect them, If they make sense, all good. Sorry to not help a lot in this case. It seems that the parameters to play most are the one related to the hairpin size, or min loop length, but I think your species is not so weird, so I will play not much, maybe some different values to see difference in results.

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.0 years ago by Lorena Pantano ▴ 380

0

Entering edit mode

ok. I shall do so and lets see how else can I solve my original problem though you were really helpful. And thank you so very much for all time you took out for this post. :)

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 10.0 years ago by SJ Basu ▴ 60

0

Entering edit mode

Hello

Recently I am analyzing sRNA data..and I saw your post because I am also exploring that which tool is better for this analysis. If I am not wrong then 194346 reads from collapse file that's why it is showing less number.

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 9.6 years ago by DL ▴ 50

0

Entering edit mode

Hello,

No Deepika actually I did the calculation from collapsed file too. The thing is they enlist only mapped miRNAs, and mirdeep2 is good for animals but kind of falls short with plants so, I went with miRPlant and it was pretty satisfying. If you are analyzing plants try that or else miRDeep2 is good.

ADD REPLY • link updated 2.8 years ago by Ram 44k • written 9.6 years ago by SJ Basu ▴ 60