Problem interpreting 'Per base sequence quality' report
1
0
Entering edit mode
10 months ago
Sanjukta • 0

I am trying to analyse RNA-Seq data on Galaxy. On performing FASTQC before and after trimming with Trimmomatic, my Per base sequence quality report looks link this. What is the interpretation? Can i move forward to the next step? Or this dataset has to be discarded?

enter image description here

Link to the complete reports of the same dataset file, before and after trimming. https://drive.google.com/drive/folders/1BGYwjNCrowIKuhcjhp4yklp1Sgly0j__?usp=sharing

galaxy fastqc • 525 views
ADD COMMENT
1
Entering edit mode
10 months ago

The blue line is the mean quality score which is very good, always around 36. See here for an example report of good sequences that additionally have boxplots: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/good_sequence_short_fastqc.html

Normally you have boxplots, I found an explanation for why your boxplots are missing in the Github issues: if the range is narrow no boxes are drawn. So your data is consistently too good :)

This run looks great to me!

ADD COMMENT
0
Entering edit mode

Thanks a lot

ADD REPLY

Login before adding your answer.

Traffic: 2656 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6