Hi Everyone,
I am working on Whole Genome Sequencing and analysis of Human genome from illumina HiSeq platform with about 30X coverage. Each sample (human genome) have about 250-300 fastq.gz files, whom I am dealing with 'fastqc' for quality check using following command :
/usr/local/bin/fastqc -t 8 -f fastq -o OUT/ -casava *.gz -noextract
Although it is running fine and generating equal number of "fastqc.zip" files which I unzipped using unzip '*.zip'. So, here I have 2 questions:
- Can I merge two or more fastq files and then run fastqc on those merged files? If yes, how should I merge those fastq files?
- I have to manually check 250-300 fastqc folder to know the quality by opening .html page. Is there any way where I can have summary of overall quality of the fastq files in a flowcell?
Please let me know your comments. I'll be highly thankful to you.
Best,
Ravi
Visiting from the future... If anyone is dealing with this issue, they may also want to check out MultiQC.
@Madelaine Gogol Just what I was needed!
@Madelaine Gogol,
I have 10 fastq.gz files (including R2 reads). I made a sample_name.txt with name of fastq.gz files (10 files) and ran fastqc.pl by the following command.
fastqc.pl -name sample_Name.txt
. But I have got null out put. Please can you help me to run your programI don't usually run it with that option, but it looks like it's expecting sampleName[tab]adapter sequence in the file. That is just to get the names of the sample. You would still have to pass in the fastq files as an argument. like --files *.fastq.gz (if you were in the same directory).
@Madelaine Gogol, since a long time we are looking for such a nice solution to merge all the fastqc reports in a single html file. However the script runs only on the first file in the folder and stops then. Do you have an idea why? We are using following command:
Not really... What is the format of the "names" file? Did you make any changes to the fastqc script besides changing the name? Maybe try it with less arguments at first to see if it runs that way - like from inside the directory of fastqs with just --files "*.fastq".