Hi
I have an RNA-seq data, using that
1) i have mapped the reads to the reference genome
2) removed the reads mapped to rRNA region
3) then again mapped the remaining reads to the reference genome, after this if i check the rRNA region there are still some more reads mapped to it
For example, Before removing rRNA reads, this reads is aligned as shown below
SRR1033691.788517 141 * 0 0 * * 0 0 GAATGGGGAAGTGTCTAAGGGCGCATGGTGGATGCCTTGGCATCGAGAGCC IIIIGIIIIIGIIIIIIIIIIHHIHGGGBGIIIIIIIIIGFFHIIBEBGD> YT:Z:UP
but after removing rRNA reads, this reads is aligned as shown below
SRR1033691.788517 129 gi|448814763|ref|NC_000962.3| 1473654 23 51M = 3440252 1966649 GAATGGGGAAGTGTCTAAGGGCGCATGGTGGATGCCTTGGCATCGAGAGCC IIIIGIIIIIGIIIIIIIIIIHHIHGGGBGIIIIIIIIIGFFHIIBEBGD> AS:i:-29 XN:i:0 XM:i:5 XO:i:0 XG:i:0 NM:i:5 MD:Z:1T0G1T0T1T43 YS:i:0 YT:Z:DP
Can anyone tell me why or how does this happen?
Which mapper/aligner did you use?
And how did you remove the reads that aligned to the rRNA region(s)?
After this if I check rRNA region, there are still some reads mapped. but the alignment score is not good so can I consider the reads mapped to rRNA region after 4th step?
Aligned using bowtie2