HI folks
Sorry for posting a very basic question.
I have data for 2x150 bp from NovoSeq that I would like to do de novo transcriptome assembly. My understanding is that we should avoid hard trimming and implement length filtering. Some studies even say Q5 with 36 bp length cut-off.
I however look around new publications published in 2019 and they are still using Q20. May I have your suggestions what is best? Also the triming length what is the cut-off length I should use since I have longer reads when the previous studies are mostly 100 bp?
Thanks in advance!
My best suggestion is first trim the adapters then use FastQC to look it after excluding the adapters, If the main things like sequence length and and Sequence quality score and N content were fine, you can proceed your analyze. Dont try to Trim too much your data because you will have the false and pointless results.