Hey everyone,
I am running an experiment with 4 samples paired-end among 2 conditions (Control vs Mutation) and 2 replicates of each one (C1, C2, MUT1, MUT2).
After mapping with segemehl, I build the transcripts with Cufflinks. So, at the end I have transcripts.gtf
, genes.fpkm_tracking
and isoforms.fpkm_tracking
. Now I have to pick the count (FPKM) of each gene and divide by a certain value corresponding the count of plasmid that was inserted in each sample and then proceed with the pipeline (cuffmerge and cuffdiff).
This values can be found in the table bellow.
Sample Value
C1 445.188/0.296
C2 137.217/0.196
MUT1 340.072/0.143
MUT2 643.493/0.271
But how can I do that? I already tried to edit the output from cufflinks and divide the counts of the 3 files, but when I merge the transcripts, all values disappear. I can't try after runs cuffmerge because the samples are merged and I can't discriminate the samples.
Is there a way to do it?
cufflinks package also outputs estimated raw counts. You could use them to normalise again.
You don't. Cuffdiff is only designed to be used in a few predefined ways, of which what you're trying to do isn't one.
Ok, I will try to use DeSeq2 with the raw counts from cuffdiff, but the values are not integers. Deseq2 can accept this kind of values?
No, you'll either need to round them (not ideal) or instead use either edgeR or limma/voom.
Either that or use something like
htseq_count