I am doing the classic pipeline for DGE Analysis. I have quantified some rna-sequences with Salmon and now I have imported them with tximport package (as the vignette says). I am new to the field and I am stuck at this point. On the vignette, before performing DGE with Deseq2 , they create a sampleTable
which must have the same row names as the column names of txi$counts
. I don't know why I have to create this table and how Iit can be useful for the analysis. The matrix with gene/counts (txi table) isn't enough ?
Thank you.
I have healty and not-healty sample. I am very confused sorry.
Basically the variables that you will be using in your model formula should come from the
colData
, which is constructed from thesampleTable
you will create. So, in your case yoursampleTable
will have one column with a name likehealth_status
and two levels within it, either ahealthy
ornon-healthy
and the row names of this table should correspond to the column names of your count table as you also mentioned.So basically I have to costruct the
sampleTable
using thehealth_status
column that I have from my data. And that's it. Thank you for the clarification.