Hey everyone,
I downloaded the Rna seq data from the organism of interest and my goal is to produce a de novo transcriptome assembly. When I ran fastqc on the raw reads I get a warning or failure on the "per base sequence content" module(the problem is detected at the first 15 bases
AND I get checkmark on the adapter content..I could use trimmomatic with the headcrop
parameter, but I think that's not an efficient way(too much info lost). Can you suggest me an efficient way of getting the checkmark on this module? (without cropping all the reads)
Thank you
Also to note is that the values in the first ten positions are not binned, after which they become an average of 2 positions.
This is a common case. I would trim 5` region and map them if alignments are improved after trimming.