Quantifying miRNAs from a ribosome-depleted whole transcriptome library can be challenging because traditional library prep methods aren't optimized for small RNAs. Here are some suggestions to improve alignment and quantification:
Read Length and Trimming:
Since miRNAs are ~20-24 nt, aligning 100 bp reads directly to mature miRNA sequences can lead to poor results. Use tools like Cutadapt or Trimmomatic to trim adapter sequences and shorten your reads to match miRNA lengths before alignment.
Aligning with Bowtie2:
Bowtie2 is well-suited for short reads. When aligning to the mature.fa file from miRBase, try setting parameters for very short, exact matches:
bowtie2 -x miRBase_index -U trimmed_reads.fastq -L 15 -N 0 --n-ceil L,0,0.15 -k 1 -S miRNA_alignment.sam
The -L 15 parameter adjusts the seed length, and -N 0 allows no mismatches in the seed.
Using Rsubread:
Rsubread isn’t typically optimized for small RNA alignment, but if you prefer it, ensure your reads are trimmed to ~20-24 nt before aligning. Also, consider setting a smaller fragment length and allowing more mismatches in your alignment parameters.
rRNA Depletion Optimization:
Residual ribosomal RNA can impact the detection of low-abundance miRNAs. To improve your rRNA depletion step in future experiments, consider using Zymo Research’s PureRec Duplex-Specific Nuclease (DSN). PureRec DSN efficiently removes rRNA and highly abundant transcripts, enhancing the detection of miRNAs and other low-expression targets, resulting in cleaner, more informative libraries.
Quantification Tools:
After alignment, use tools like miRDeep2, mirge2.0, or featureCounts (if using Rsubread) to quantify miRNA expression accurately.
By optimizing trimming, alignment, and rRNA depletion with PureRec DSN, you’ll likely improve your miRNA detection from whole transcriptome libraries. Hope this helps point you in the right direction!
Thanks for the reply!
All the information I have is the following, but I guess that wouldn't be sufficient?
The libraries were prepared using out whole-transcriptome CORALL v2 kit with ribodepletion (RiboCop). The read length was SR100.
TruSeq adapters are used in the libraries: Read 1: AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC
Confirm this is the kit before proceeding.
https://www.lexogen.com/wp-content/uploads/2023/07/171UG394V0111_CORALL-RNA-Seq-V2-with-UDIs_2023-07-12.pdf on page 36 has instructions for data analysis.
This appears to be a total-RNA kit so you will have things other then miRNA (unless something special was done).