Hello guys, I need help with processing gnomad variant data.
I downloaded reference bgz file ( gnomad.exomes.r2.0.2.sites.vcf.bgz ) from here http://gnomad.broadinstitute.org/downloads
But I need to uncompress this bgz file into regular vcf - I need to process it locally variant by variant. Is there any way how to do it?
I tried to decompress the file with bgzip but I had this error:
bgzip --decompress gnomad.exomes.r2.0.2.sites.vcf.bgz
[bgzip] gnomad.exomes.r2.0.2.sites.vcf.bgz.tbi: unknown suffix -- ignored
Is there any way how to get plain vcf from bgz file?
Thank you in advance
Hello,
what kind of analysis do you like to perform?
Normaly there is no need to unzip. Most existing software for anslyising vcf can handle compressed file. If you write your own one, there is a good chance that there is a module which can handle it.
One more thing you can do is to use
zcat
. This prints the content of a compressed file directly to stdout and you can read it in on the fly to your program.fin swimmer