Hey colleagues,
Summary some good habit in our research. I have been hit by the project badly since some bad habit, such as:
- Record everything in a project in one systemic page, such as
Wiki
orEvernote
, so that you can check them easily. Never try to remember everything if you put them everywhere. - Save all the data which you were used to make the figure, since sometimes boxplot will be change to
violin plot
or heatmap plot orbee swarm plot
. You will never know which is the prefer for your boss or reviewer. If you don’t save them, maybe you need to re-built the data again. - Keep the figure as
PDF
forever, you know, JPEG, TIFF, PNG is not what you need in the publication. - Use Adobe illustrator, Never Never Never use Photoshop.
- Learn to use ggplot2, it would be more fast to prepare Figures if you master it compared with R plot.
- Build your own function (Perl, R, Python)
library/packages
. Compile and Use them for next time. Don't write them again and again. - Upload the code to
github
orgitlab
, share with yourself and others. - record all the method, idea, process, procedure and pipelines in
mediawiki
and shared with your lab-mates - Save the fastq to SRA/GEO or wig to UCSC so that we don't need spend extra money after we complete the project
- The code or script by non-professional stuff/student would be horrible, Majority of them will have some bugs, be careful, asking help for
code review
from colleagues would be good habit. - how to
prepare your manuscript
and the efficiency: link: the best habit to prepare manuscript - Time
Management Strategies
and Advice for Bioinformaticians: Link here - Build your own bioinformatics server and assemble all the platform your need and your own pipeline.
- Arial for font in the Fiugre, never use red-green combination, never use rainbow color scale, Font size:8pt
- Never never make your script running for 12 hours (especially in PBS), split them into many pieces within 2 hours. You boss will be in the trouble if you meet bugs for several times.
- try to use
Anaconda
data science platform and assemble the tools what you prefer as a uniform platform. fork
and help to make your frequent software more powerful ingithub
- check the positive and negative control for each computational analysis, so that find all bugs in the beginning.
- maintain your blog/make md5sum label for each your own database
More suggestions?
Curious why you dislike Photoshop? I do most of my figure creation in GIMP, so it isn't vector based like AI is. But i've never had any problems with it.
Photoshop is really only appropriate for editing images of gels, or things like that. For generating or editing other types of plots, which should be scalable and vector-based, Illustrator (or Inkscape, etc) is the right tool.
Yep. I generate almost all my plots in R, including very complex ones. But for final polishing, required figure dimensions, dpi, color profile - I also use Photoshop.
Isn't Illustrator better for editing vector graphics like PDF? Just curious why Photoshop...
Photoshop and Illustrator are both $29.99 a month. Meanwhile, e.g. GIMP and ImageMagick are FOSS.
and Inkscape as a direct Illustrator alternative!
If you're a student, it's 20 bucks for everything :)
I've been on that deal for what seems like most of my adult life :P As great as GIMP and ImageMagick are (and ImageMagick is particularly good with the command + extensions like montage), once you learn where everything is in Photoshop and Illustrator, there's really no competition. I mean, GIMP and IM are really good considering they're totally free - but I think you get what you pay for with Adobe's Creative Cloud. You even get cloud storage and some other perks (like your username/password dumped online every now and again...heh).
But the best thing about going Adobe is that there are online guides/tutorials for just about everything. I had a particularly tricky issue the other day involving intersecting two SVG heatmaps, which i could 'solve' in Illustrator in about 10 minutes thanks to a guide someone made in 2001 :P