How To Get Strand Information For List Of Intervals In .Bed File (To Use With Picard Calculatehsmetrics)
1
1
Entering edit mode
11.0 years ago
User6891 ▴ 330

Hi everyone,

I want to use Picard's CalculateHsMetrics tool. To do this one needs a BAIT_INTERVAL & TARGET_INTERVAL file, which both need to contain 5 columns (chromosome, start, end, strand & name). I want to test this on whole exome data. The company has a .bed file with the target (Nextera Exome), but this file does not contain information about the strand. Since this is mandatory for Picard, I was wondering if someone knows how to get this information? Is Picard actually using this information to calculate the metrics?

strand bed picard • 4.8k views
ADD COMMENT
2
Entering edit mode
11.0 years ago
brentp 24k

I usually just use "+" for all of them when I dont care. For capture, my regions look like (for columns chrom, start, end, strand, name):

chr1    3534355 3534595 +       chr1:3534355-3534595
chr1    3593248 3593368 +       chr1:3593248-3593368
chr1    3611230 3611590 +       chr1:3611230-3611590
chr1    3670534 3671089 +       chr1:3670534-3671089
chr1    3671123 3671483 +       chr1:3671123-3671483
chr1    3671632 3672740 +       chr1:3671632-3672740
ADD COMMENT
0
Entering edit mode

That's what I was planning to do too, but I want to be sure that picard HsMetrics is not using this strand information for some calculations

ADD REPLY
0
Entering edit mode

For most metrics it does not. The CalculateHSMetrics tool is great, but is also very finicky and gives strange error messages. I've run this tool a lot so post again if you run into any problems.

ADD REPLY

Login before adding your answer.

Traffic: 2350 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6