I am not sure whether this is an appropriate question for this board, but if anyone can help, I would be immensely grateful.
I am trying to create a figure for a publication, showing an overview of the bioinformatics steps taken in our study. The idea for this plot came from an answer that I saw here at BioStar: A: Best way to visualize Next Generation Sequencing tumor evolution in a graphic?. That plot was generated using ggplot2, I believe.
A bit of background about the information I want to have in this plot: at each step of the bioinformatics, we have a number of upregulated genes and downregulated genes. From one step to the next, the numbers usually become smaller, but not always. I want to show this with "step number" on the x-axis (step 1, step 2, etc.). I want to divide the y-axis into two: number of upregulated genes on the top half, and number of downregulated genes on the bottom. The way I envision this plot turning out, is that there are two lines (one for upregulated genes, one for downregulated genes) that smoothly meander from value to value as one moves from left to right on this plot, looking much like the one that I referred to earlier.
I am quite proficient in R, but have no experience with ggplot2 as yet, but I do have a copy of the ggplot2 book that I have just barely started reading through. So far I have not found out how to manage to execute this plan however. Can anyone advise me about how to get started with making this? Preferably not with MS Paint :-)
The plot you are referring to was certainly not made using ggplot2, and it if was it shouldn't have been. The Grammar of Graphics plotting system is all about expressively combining data and models at plotting time to display data and statistical transformations quickly, together on the same graph. The figure you link to was most likely created in a vector graphics software such as Illustrator or Inkscape (though most biologists prefer to misuse Powerpoint for this). The scatter plots at the right might have been made in ggplot, but appear to be styled more like base graphics in R.
I see. I will check out these vector graphics softwares and try to make sense of them then. Thanks to both of you for your tips!
What makes you think it was made using ggplot2?
I might be wrong about that (couldn't find a reference in the article itself about how it was produced). A colleague told me that he suspected that it was, based on other plots he had seen in publications, even though he didn't know how to use it himself either...
Do you think it was not ggplot2? If so, any idea how I could make such a plot instead?
Personally I think it was most likely made with a vector graphics program (maybe Inkscape or Illustrator), as it's mostly qualitative (except for the percentages which could be drawn via measuring) and there's lots of embedded images. edit: Matt below posted the same general idea as I was writing this