Hello,
I'm trying to run Hisat2 on 2 fastq files which are currently fastq.gz as downloaded from the servers of the people that did the sequencing for us. When I try to run it though, even though my code specifies fastq.gz, I keep getting the error that the files are not gziped.
This is the code im using :
hisat2 -p 7 -x -known-splicesite-infile /usr/local/src/gtf.gtf mm10idx -1 WTCHG_284763_229_1.fastq.gz -2 WTCHG_284763_229_2.fastq.gz -S eg2.bam
Any ideas? thanks
What do you get with these two commands
file WTCHG_284763_229_1.fastq.gz
orzmore WTCHG_284763_229_1.fastq.gz
(ctrl+c
to return to system prompt)?it outputs this
is that good or bad? :/
What do you get for
file WTCHG_284763_229_1.fastq.gz
?Based on your answer below it looks like these are compressed files but they may have been corrupted when you downloaded them. Can you try redownloading them? Did you change the file name in any way during the download?
file outputs this:
I've downloaded the files using filezilla which is usually fine but I'll try re-downloading from the server.
That would the the thing to try. I would suggest downloading the file using
sftp
directly on the server and taking outfilezilla
(as long as your server has a direct internet link).I've checked the md5sum of the files and they don't match so something is going wrong with the files while they're being downloaded. Tried sftp and filezilla and both have mismatches. Any suggestions for alternatives? :/ Usually I've been using cyberduck but it completely doesn't work lately.
Found a questionable solution to this online and tried it anyway and it worked.
Apparently my firewall must be doing something to the files. Switched the firewall off and then downloaded the files and everything is ok! md5sums etc
Firewall on Win 10 or virtual box? It is interesting that the file contents are mangled by the firewall rather than just preventing you from connecting/downloading.
Firewall on both, downloaded via ubuntu - mangled, downloaded via windows 10 - mangled again. But after switching the firewall off and downloading via filezilla (in windows) everything worked fine.
Hisat2 is working fine on them aswell so no idea why the firewall being up would do that to the files. First time its happened because I've been using this machine/OS for over a year. Maybe it was something they changed in the recent update to windows10...