Parse FASTQC data
2
2
Entering edit mode
6.9 years ago
win ▴ 990

Hi all, I have a question regarding FASTQC . When we run FASTQC on FASTQ file it generates an HTML report. Is there any tool out there that can read FASTQC data, parse it and provide it in raw form. I need this so that I could use the values in an algorithm to determine if the data is of good quality.

Any help will be highly appreciated.

NGS • 4.5k views
ADD COMMENT
0
Entering edit mode

isn't there usually a txt file associated with this html page ? e.g: https://github.com/ChillarAnand/fadapa/blob/master/tests/fastqc_data.txt

ADD REPLY
0
Entering edit mode

If I remember correctly, it is printed out only if you specify an --outdir.

ADD REPLY
1
Entering edit mode
6.9 years ago
James Ashmore ★ 3.5k

By default FastQC should generate a zip folder with files containing the data used for plotting.

ADD COMMENT
0
Entering edit mode

This is the right answer .. at least when fastq is run on the command line (e.g. fastqc file.fq.gz file2.fq.fz). I don't recollect if you run fastqc using the GUI this works.

ADD REPLY
0
Entering edit mode
6.9 years ago

Specify an --outdir and all the output files will be placed there, including a txt report that you can easily parse, and the html file that you can open to see the results in a graphical-friendly format. The two files contain the same information.

ADD COMMENT
0
Entering edit mode

there is now a tool available to parse the output of fastqc into a python dictionary, which can then be converted into pandas dataframes for downstream aggregation and querying

https://pypi.org/project/fastqcparser/

ADD REPLY

Login before adding your answer.

Traffic: 2194 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6