DEA.R
I wrote a command line Rscript to perform differential expression analysis:
- Conveniently
- Quick
- Reproducibly
The script uses the R packages DESeq2, edgeR and limma-voom. It's available on github. There is still room for improvement, so if you would be interested in contributing I'd definitely welcome that.
Features
- Performs counting using featureCounts
- Allows specification of covariates
- Rigorous checking of input data
- Creates various plots
- Creates detailed tables and lists of differentially expressed genes
Usage
DEA.R < sample info file > < annotation.gtf >
with:
- sampleInfoFile: file specifying samples with additional covariates
- annotation.gtf An annotation file in gtf format matching the reference genome used for alignment
More information can be found in the README on github
The script performs a few sanity checks on input data and then executes differential expression analysis with DESeq2, edgeR and limma-voom. The results are written to tab delimited files and the script saves all potential useful plots in image files.
Changelog
2017-05-20: major update in code and usage, added readme
2017-05-27: minor update in code and usage
@All: If you have your own implementations that offer similar functionality then include them here. That way we will have a collection in one post.
Can you add an example command line (and some dummy input file/comparison conditions) to this post? People may not realize the utility of the script at the link above by just looking at it.
Oh, right, edited the original post. Thanks!
Needs a readme, example usage, example and test datasets and if it all checks out think about writing it up in say:
BMC: sourcecode for biology and medicine
Thanks for the advice, and I definitely agree with the first parts and I'll make sure to add those. But since this is just code put together from manuals and forum posts here and stackoverflow, does it make sense in publishing that? I'm not sure that I can take credit.
Looks like you could publish it as a 'Workflow' manuscript. Though Istvan will be able to better comment on it's feasibility.
Way to go. I saw it earlier. Was waiting for you to put it here. Great job done. I am thinking if I should also put the automate script with salmon or not that gives both count and tpm tables. Post that I usually use a procedural script based on my need. But definitely the DEA.r will be worth taking a shot for enchantments if I use it for our lab as well. Let me know what you guys think.
Thinking about using this instead of my own script. Are there examples of outputs including tables and figures that are generated?
Not right now, but I could generate those. Note that this isn't actively maintained anymore and there are multiple alternatives including GUIs. But you are most welcome to use the tool :)
Thanks for letting me know this!
Do you happen to know what are the alternatives? I'm looking more for command line script since it takes a long time to process these rna-seq files.