Trying to assign GO and Kegg categories to some ChIP-seq peaks with ChIPseeker and having the following issue:
library("ChIPseeker")
library(TxDb.Mmusculus.UCSC.mm10.knownGene)
library(biomaRt)
library(rtracklayer)
library(org.Mm.eg.db)
peak <- readPeakFile("gains.bed")
peakAnno <- annotatePeak(peak, tssRegion = c(-3000, 3000), TxDb = TxDb.Mmusculus.UCSC.mm10.knownGene, annoDb = "org.Mm.eg.db")
During the annotatePeak step getting this error, not sure what it means exactly:
>> preparing features information... 2017-09-26 10:39:55
>> identifying nearest features... 2017-09-26 10:39:55
>> calculating distance from peak to TSS... 2017-09-26 10:39:55
>> assigning genomic annotation... 2017-09-26 10:39:55
>> adding gene annotation... 2017-09-26 10:40:08
'select()' returned 1:many mapping between keys and columns
>> assigning chromosome lengths 2017-09-26 10:40:08
>> done... 2017-09-26 10:40:08
I guess my issue is that as input I am using an output from DiffBind which does not entirely match the standard BED or narrowPeak formats. It seems what I need to do is retrieve those parts of the DiffBind input narrowPeak files that I get in the output - to maintain that format precisely.
you can also try to get GRanges object using chippeakanno package, then compare its output with the GRanges object generated by chIPseeker. Eventually, annotatePeak will use the GRanges object for peak annotation.