I am trying to analyze Differential Gene Expression (DGE) in massive Counts files coming from RNA-seq and HTSeq-count
. Doing this using DESeq2
takes very long time**. Also, R
is not my language of choice.
What else is there to do DGE? I'd prefer the following (sorted by descending preference):
- a native
Python
package (hopefully allowing parallelism) - executable/binary tool (maybe something more modern than old Tuxedo tools)
- something else that does not internally call R packages e.g.
DESeq2
/EdgeR
. One could callDESeq2
fromPython
usingrpy2
but it would be slow. - Something that can employ power of Apache Spark cluster (
Python
/Koalas
?) - OK, be it
DESeq2
, but maybe parallelize it somehow :-)
Thanks for your help in advance!