Hello,
I am using GenomicsDBImport, but since my input file is a file containing a list of file I don't know how to use this as input with GATK along with the -V
option. I've been looking for this everywhere, but I can't find this information in the GATK manual.
gvcf_list=gvcf.listaa
java -jar gatk-package-4.1.4.0-local.jar GenomicsDBImport \
-R HG38/hs38.fa \
--genomicsdb-workspace-path /MY_DATABASE/$newdir \
-V $gvcf_list \
-L resources_broad_hg38_v0_wgs_calling_regions.hg38.interval_list
Thank you in advance
Hi Bari.ballew,
We could also do this as:
Either of the scripts gives me output as below:
It is probably is my OCD :) and pardon my basic question here but what is bothering me is that why is the tab spaced so wide in all samples other than sample_9? I checked it by opening the file in vi and they are all tabs and no spaces, but just that sample_9 is off! even if it was single digit I dont think it should be that way, is'nt it? I am worried if I dont fix this and I feed into GenomicsDBimport I may eventually get an error or a wacky DB and it would end up being more work. It is just bothering me!! Any input will help!! Thankyou
oops ! placed this in the wrong place. should be after Bari.ballew's comment ! my bad! but you get it .... :)
Sure! As the Perl programmers say, TIMTOWTDI!
It's because of the different number of characters. sample_9 has one less character than the other samples, so it hits the default tab spacing differently.