Unusual FastQC sequence distribution from small RNA seq
1
0
Entering edit mode
3.2 years ago
Christy ▴ 20

Hi,

I am attempting to analyse some small RNA sequencing data produced using an Illumina TruSeq Small RNA Library Preparation Kit. The RNA was isolated from sheep serum. Pre-sequencing QC was fine and post sequencing looked good too - apart from unusual sequence length distribution. We were mostly expecting miRNAs and so peaks at ~20-25nt but instead we have peaks at 44bp and 60-62bp:

post-trimming sequence length distribution

Additionally when the tool miRDeep2 was used to map these to the Ovis aries genome, we are getting extremely low mapping rates (sub .1%). I also tried blasting some of the sequences and was getting no results. Is this kind of sequence length distribution telling of any particular issue? We thought it may have been poor trimming but there appears to be no adapter contamination. Any advice is much appreciated!

Many thanks, CW.

mapping rna-seq • 1.9k views
ADD COMMENT
0
Entering edit mode

Hello,

does this kit includes enrichment steps and adds unique molecular identifier (UMI) to the fragments or something similar?

ADD REPLY
0
Entering edit mode

Hi Olli,

We didn't use the kit ourselves it was undertaken by the university core genomics unit but I don't believe there was UMI added - I will ask the core genomics unit to confirm if this was the case.

ADD REPLY
0
Entering edit mode

Did you look for presence of specific small RNA adapter sequence for TrueSeq kit (TGGAATTCTCGGGTGCCAAGG) first? Any reads that do not have this adapter are likely not useful/usable. Once you find the reads that contain this sequence you will need to trim the adapters and then use the remaining read to align using an ungapped alignment.

ADD REPLY
0
Entering edit mode

Did you resolve your problem? I have this profile and I don't know if le length corresponds to miRNA + the sequence adapter. Does anybody knows?

enter image description here

ADD REPLY
0
Entering edit mode

Answer is probably. Check which kit was used for the library prep and then trim the data.

ADD REPLY
0
Entering edit mode
23 months ago
Barry Digby ★ 1.3k
for file in ${DATA}/*; do

    trim_galore \
        --adapter TGGAATTCTCGGGTGCCAAGG \
        --length 17 \
        --clip_r1 4 \
        --three_prime_clip_r1 4 \
        --max_length 30 \
        --gzip \
        --fastqc \
        $file

done
ADD COMMENT

Login before adding your answer.

Traffic: 1930 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6