Does anyone know where to get intervals files for Illumina WES? I was only able to find this.
For reference, I want to run coverage/depth analysis for one of the BAM files that I have. I tried looking into what preparation went into sequencing based on the BAM header but only got ID:DS-bkm-112-N_L001_RG SM:DS-bkm-112-N_L001 LB:lib_name PL:illumina
.
I think I'm able to run coverage analysis without the intervals file but it would be whole genome analysis, if I'm not mistaken. Don't think this would be very accurate but better than nothing?
That BAM header does not tell you what WES kit was used. It just says that the sequencing platform was Illumina. Illumina makes their own WES kits, but there are many others.
I am aware, but was hoping I was wrong...so the best I can do is just run a whole-genome based coverage like GATK's DepthOfCoverage?
Whole-genome coverage probably would not make sense. You are only expecting to capture about 2% of the genome with WES. However, you can use all known exons as your intervals. That would not be perfect, but it would be much closer to the truth.
Do you know where I could get such an interval file?
Also, if I did whole-genome coverage, wouldn't that just be the same as WES but with many more blank intervals? The aggregated interval statistics would probably be off, but for the relevant intervals, would they be more or less accurate?
You should find out from the originators of data which kits was used or did you download this data from SRA?
No I received this through many different people and it is difficult to contact the originators
You would still have to define "relevant intervals" somehow.
Wouldn't I just look for intervals that have coverage? Wouldn't those be the "relevant intervals"?
If you look for regions that have coverage, they will be covered.
Usually, the main reason to check for coverage is to see how many regions of interest are covered.