Hi!
I'm looking for variant that 5% of all vcf files share The total number of vcf files is 2500 (== 2500 individual WGS data)
so I tried bedtools and bcftools. they have function "intersection" I think bcftools intersection (isec) is best for me.
so I run bcftools
bcftools isec -n-24 -c all -p path/to/dir ${VCF_LIST}
-n-24 : "variants present in 24 or less files among the 2500 files"
but error was happen. "Could not load the index"
I think this error was caused by the open file limitation by linux (ulimit -u : 4096[max user processes], ulimit -n 385778[open files]).
How can I fix it ?
Somebody help me please
Thank you all
all vcf files must be indexed with bcftools prior to use isec
but later.. you might still have some problems with opening 2500 files....
unless the error was "bcftools cannot load the index because too many files where open"