I want to know the standard value of quality score or sequence length in trimming by fastx_toolkit.
In my RNA-Seq textbook, author trim fastq sequences whose base quality scores are under 20 and lengths become under 30bp (original length is 100 bp). Is that the standard value or changes case by case ?
If there is standard value of quality score or sequence length that many biologist adopt, please tell me that value. Also, if it changes case by case, please tell me how biologist judge and decide its value.
There is no standard for these things since every dataset is different. You will need to experiment and find out what your dataset looks like on QC. If you have a good reference to align to, you may be able to use data down to Q15.
If possible omit using
fastx_toolkit
which, is old by NGS standards.bbduk.sh
from BBMap suite,trimmomatic
orcutadapt
are all great alternatives.