Hello everyone,
I have a GrangeList
with 222 Granges
objects that looks like that:
GRangesList object of length 222:
$ENCFF005ZCI
GRanges object with 25414 ranges and 2 metadata columns:
seqnames ranges strand | name score
<Rle> <IRanges> <Rle> | <character> <character>
[1] chr1 109100352-109100376 + | DDX21_K562_rep02 200
[2] chr1 109100322-109100351 + | DDX21_K562_rep02 200
[3] chr1 44776511-44776537 + | DDX21_K562_rep02 200
[4] chr1 109100273-109100313 + | DDX21_K562_rep02 200
[5] chr1 75789482-75789548 + | DDX21_K562_rep02 200
... ... ... ... . ... ...
[25410] chrX 73827664-73827696 - | DDX21_K562_rep02 200
[25411] chrX 154784030-154784047 - | DDX21_K562_rep02 200
[25412] chrX 73851381-73851404 - | DDX21_K562_rep02 200
[25413] chrX 53380052-53380072 - | DDX21_K562_rep02 200
[25414] chrY 7324489-7324510 - | DDX21_K562_rep02 200
and I also have a single Granges
object with transcriptome ranges:
> transcriptome
GRanges object with 78807 ranges and 2 metadata columns:
seqnames ranges strand | tx_id tx_name
<Rle> <IRanges> <Rle> | <integer> <character>
[1] chr1 11874-14409 + | 1 uc001aaa.3
[2] chr1 11874-14409 + | 2 uc010nxq.1
[3] chr1 11874-14409 + | 3 uc010nxr.1
[4] chr1 69091-70008 + | 4 uc001aal.1
[5] chr1 321084-321115 + | 5 uc001aaq.2
... ... ... ... . ... ...
[78803] chrY 27605645-27605678 - | 78803 uc004fwx.1
[78804] chrY 27606394-27606421 - | 78804 uc022cpc.1
[78805] chrY 27607404-27607432 - | 78805 uc004fwz.3
[78806] chrY 27635919-27635954 - | 78806 uc022cpd.1
[78807] chrY 59358329-59360854 - | 78807 uc011ncc.1
-------
seqinfo: 93 sequences (1 circular) from hg19 genome
I would like to calculate for every item in the GrangeList
the percentage/fraction of bases covered by the transcriptome Granges
object. I'm aware of the coverage
function in the Granges
package but after much reading I struggle to understand even with a simple example what this function is really doing. Any help is much appreciated, thanks.
yeah I think that makes sense, perhaps I should divide by
sum(width(reduce(transcriptome)))
instead to get the percentage of thegrangelist
covered by thetranscriptome
object.