Question

Fastqc And Require Tools

5

Entering edit mode

13.4 years ago

Ric ▴ 440

Hello,

FastQC ( http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc/ ) provide an overview what is wrong with a Fastq file.

However, it does not provide any tools to fix the problems. What kind of tools have to be used to fix the problems?

Thank you in advance.

fastqc illumina next-gen sequencing sequence quality • 7.9k views

ADD COMMENT • link updated 11.7 years ago by Phil S. ▴ 700 • written 13.4 years ago by Ric ▴ 440

score 9 · Answer 1 · 2012-02-13

FastQC is just an informative tool, very useful in my opinion to evaluate how the lab sequencing jobs have performed before going into mapping. but as any informative tool, it just tells you what's going on down there, and of course you are the one ultimately having to curate those reads in case you need to. but you'll have to know that there are things that may be curated, and there are others that you won't be able to do anything with them.

since the available tests performed by FastQC are multiple, there isn't a particular to fix the reported errors if any. things like the "per base sequence quality" for instance may be solved by simple sequence trimming if the reported error is that the last 3 bases are always wrong, so they should always be removed from posterior analysis. other error reports may be also solvable by simple scripting, but most of them, such as "per sequence GC contect", are purely descriptive, and in case they fail to pass the FastQC thresholds there's nothing you can do about them but to repeat the lab sequencing if wanted.

score 4 · Answer 2 · 2012-02-13

4

Entering edit mode

13.4 years ago

Neilfws 49k

The FASTX toolkit may help in some cases; there are tools to filter, trim and clip low-quality sequence. There are also some tools around to "heal" sequencing reads, which may be able to correct platform-dependent systematic errors.

However, as Jorge says, QC generally tells you whether or not something quite fundamental is wrong at the experimental level. If it is, you can't always just "fix" it computationally.

ADD COMMENT • link 13.4 years ago by Neilfws 49k

0

Entering edit mode

+1 here. I guess I focused too much in how I think FastQC results should be taken into account, rather than in answering the question itself. yes, FASTX toolkit would be the approach I too would recommend for anyone willing to play around with fastq file, plus its integration in Galaxy may reduce the slope of its learning curve.

ADD REPLY • link 13.4 years ago by Jorge Amigo 14k

score 2 · Answer 3 · 2013-11-13

2

Entering edit mode

11.7 years ago

optimuscoprime ▴ 140

You could try https://github.com/optimuscoprime/autoadapt

It will remove contaminant adaptors and primers, as identified by FastQC, as well as removing low quality sequences

ADD COMMENT • link 11.7 years ago by optimuscoprime ▴ 140

0

Entering edit mode

it would be nice if you could give some support on using autoadapt on issues raised in github and also via email. Thank you

ADD REPLY • link 10.2 years ago by eva ▴ 20

score 1 · Answer 4 · 2013-11-14

1

Entering edit mode

11.7 years ago

Phil S. ▴ 700

For Illumina Sequencing i would suggest Trimmomatic, it is kind of designed for illumina and has more specific options. However, fastX is very useful!

ADD COMMENT • link 11.7 years ago by Phil S. ▴ 700