Hi I used abundance_estimates_to_matrix.pl from trinity package and generate the gene count matrix for all ~32 samples. and how do I combine into one matrix so that I feed it as a input into edgeR/DESeq2 for differential gene expression analysis. I got struck, help me.
1-8 --> Odor --> Experiment1
9-16 --> Model --> Experiment2
17-24 --> Real --> Experiment3
24--32 --> Control --> Experiment4
gene.counts.matrix
adundance
TRINITY_DN0_c0_g1 12717.86
TRINITY_DN100000_c0_g1 93.65
TRINITY_DN100001_c0_g1 110.58
TRINITY_DN100002_c0_g1 80.76
TRINITY_DN100003_c0_g1 386.84
TRINITY_DN100004_c0_g1 137.85
TRINITY_DN100005_c0_g1 52.14
TRINITY_DN100006_c0_g1 87.19
TRINITY_DN100009_c0_g1 27.88
Well, I'm out of my mind, instead of using complicated r packages. I though it simpler way to do it. I called all the csv's in a directory
How do i merge all the csv in to one using merge function (merge(S1,s2,by="GeneID",all = TRUE) ?
Head Sample1 GeneID ,1_S4 TRINITY_DN0_c0_g1,7491.29 TRINITY_DN100000_c0_g1,45.16 TRINITY_DN100001_c0_g1,95.2 TRINITY_DN100002_c0_g1,47.17 TRINITY_DN100003_c0_g1,94.85
Please don't post comments as answers. Rather, use the
Add Reply
orAdd Comment
options as appropriate.Agreed, I posted as reply. Thanks.
I answered this here: https://support.bioconductor.org/p/132018/#132020 Please do not cross-post.