Question

How to Make Dot Plots of GO Terms?

2

Entering edit mode

4.9 years ago

cthangav ▴ 110

I'm looking for a way to make a dot plot like this with my own GO terms that I got using a program called HOMER. Does anyone know of any good tutorials/ have any advice on how to make a dot plot graph like the one below with your own GO terms in R? Chipseeker uses its own GO analysis to generate these plots. https://hbctraining.github.io/Intro-to-ChIPseq/img/compareCluster.png

R dot plot gene ontology • 32k views

ADD COMMENT • link updated 3.4 years ago by azad.mahnaz228 • 0 • written 4.9 years ago by cthangav ▴ 110

1

Entering edit mode

Clusterprofiler does that

ADD REPLY • link 4.9 years ago by zizigolu ★ 4.4k

0

Entering edit mode

4.9 years ago

Shalu Jhanwar ▴ 540

Have a look at dot plot with ggplot

ADD COMMENT • link 4.9 years ago by Shalu Jhanwar ▴ 540

0

Entering edit mode

To make the plot the original poster wants in ggplot2 it would be geom_point, not geom_dotplot. The bioconductor library clusterProfiler has a handy convenience function to make these plots quickly too.

ADD REPLY • link 4.9 years ago by rpolicastro 13k

0

Entering edit mode

I am looking for a way to make dot plot for gene onthology enrichment analysis using clusterProfiler, but my organism is rice (oryza sativa) so "DOSE" package doesn't work in my case. do you have any suggestion?

ADD REPLY • link 3.4 years ago by azad.mahnaz228 • 0

score 15 · Accepted Answer · 2020-08-19

Hi,

If you have all the data with all the information that you need to make the plot, you can easily do it in R, with ggplot2.

# import package
library("ggplot2")

# create fake data
set.seed(1024) # keep reproducibility
go <- paste0("GO", sample(1000:2000, 5))
data <- data.frame("GOs" = rep(go, 2), 
                   "Condition" = rep(c("A", "B"), each = 5),
                   "GeneRatio" = 1 / sample(10, 10), 
                   "p.adjust" = 0.05 / sample(10, 10))

# plot: dot plot
ggplot(data = data, aes(x = Condition, y = GOs, 
                        color = `p.adjust`, size = GeneRatio)) + 
  geom_point() +
  scale_color_gradient(low = "red", high = "blue") +
  theme_bw() + 
  ylab("") + 
  xlab("") + 
  ggtitle("GO enrichment analysis")

In this case this fake data looks like:

      GOs Condition GeneRatio    p.adjust
   1  GO1980         A 0.1250000 0.050000000
   2  GO1213         A 0.1428571 0.025000000
   3  GO1308         A 0.1000000 0.010000000
   4  GO1396         A 0.2500000 0.006250000
   5  GO1351         A 1.0000000 0.008333333
   6  GO1980         B 0.3333333 0.007142857
   7  GO1213         B 0.1111111 0.005000000
   8  GO1308         B 0.1666667 0.016666667
   9  GO1396         B 0.2000000 0.005555556
   10 GO1351         B 0.5000000 0.012500000

And the plot like this:

enter image description here

The p.adjust usually is transformed into -log10(p.adjust) because it provides a better discrimination scale.

I can help a bit more if you provide the structure of your data.

I hope this helps,

António