GIAB Benchmark (High Confidence) Bed Filles
0
1
Entering edit mode
3.2 years ago

Hi all,

I havent used Genome in a Bottle for a couple of years. When I did use it, I recall I would download samples in VCF format for:

  1. AshkenaziTrio (three each)
  2. NA12878 (only one)
  3. ChineseTrio (three each)

I would then download what used to be called a high-confidence region bed file and intersect each of the VCF files against this. For some reason, it seems like each of these associated files are already labeled as "benchmarked."

Can anybody help me understand what's up now and if I'm misremembering how I went about this a couple of years ago.

Thanks!

SNPS GIAB variants • 1.7k views
ADD COMMENT
0
Entering edit mode

There are different releases and the older ones are still named as high-confidence, while newer releases are labeled as benchmarked. The vcf and bed files are intended to be used in conjunction to benchmark accuracy of small variant calls.

You can browse the different releases through the ftp page and the readme file can help with more information

Hope this helps!

ADD REPLY
0
Entering edit mode

Thanks, that helped a lot! Another question; I think I'll just stick to using the older high-confidence bed files for now. Is there only a single common high confidence bed file, or does every group I listed above have their own bed file?

ADD REPLY
1
Entering edit mode

Each group listed above have their own bed file.

ADD REPLY

Login before adding your answer.

Traffic: 2586 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6