Question

Unusual FastQC sequence distribution from small RNA seq

0

Entering edit mode

3.2 years ago

Christy ▴ 20

Hi,

I am attempting to analyse some small RNA sequencing data produced using an Illumina TruSeq Small RNA Library Preparation Kit. The RNA was isolated from sheep serum. Pre-sequencing QC was fine and post sequencing looked good too - apart from unusual sequence length distribution. We were mostly expecting miRNAs and so peaks at ~20-25nt but instead we have peaks at 44bp and 60-62bp:

post-trimming sequence length distribution

Additionally when the tool miRDeep2 was used to map these to the Ovis aries genome, we are getting extremely low mapping rates (sub .1%). I also tried blasting some of the sequences and was getting no results. Is this kind of sequence length distribution telling of any particular issue? We thought it may have been poor trimming but there appears to be no adapter contamination. Any advice is much appreciated!

Many thanks, CW.

mapping rna-seq • 1.9k views

ADD COMMENT • link updated 22 months ago by Ram 44k • written 3.2 years ago by Christy ▴ 20

0

Entering edit mode

Hello,

does this kit includes enrichment steps and adds unique molecular identifier (UMI) to the fragments or something similar?

ADD REPLY • link 3.2 years ago by Olli • 0

0

Entering edit mode

Hi Olli,

We didn't use the kit ourselves it was undertaken by the university core genomics unit but I don't believe there was UMI added - I will ask the core genomics unit to confirm if this was the case.

ADD REPLY • link 3.2 years ago by Christy ▴ 20

0

Entering edit mode

Did you look for presence of specific small RNA adapter sequence for TrueSeq kit (TGGAATTCTCGGGTGCCAAGG) first? Any reads that do not have this adapter are likely not useful/usable. Once you find the reads that contain this sequence you will need to trim the adapters and then use the remaining read to align using an ungapped alignment.

ADD REPLY • link 3.2 years ago by GenoMax 147k

0

Entering edit mode

Did you resolve your problem? I have this profile and I don't know if le length corresponds to miRNA + the sequence adapter. Does anybody knows?

enter image description here

ADD REPLY • link updated 22 months ago by Ram 44k • written 22 months ago by correo.jenny.gm • 0

0

Entering edit mode

Answer is probably. Check which kit was used for the library prep and then trim the data.

ADD REPLY • link 22 months ago by GenoMax 147k

score 0 · Answer 1 · 2023-01-23

0

Entering edit mode

22 months ago

Barry Digby ★ 1.3k

for file in ${DATA}/*; do

    trim_galore \
        --adapter TGGAATTCTCGGGTGCCAAGG \
        --length 17 \
        --clip_r1 4 \
        --three_prime_clip_r1 4 \
        --max_length 30 \
        --gzip \
        --fastqc \
        $file

done

ADD COMMENT • link 22 months ago by Barry Digby ★ 1.3k