Hello everyone!
I believe my questions are quite naive, but I am super new to RNA-seq data analysis, so forgive me! I already saw some questions regarding this subject, but I still do not understand it well. I am currently performing (trying to, actually) an analysis to identify differently expressed genes between two groups (with cancer, no normal samples in any group) from TCGA dataset. I've downloaded TCGA transcriptome data with FPKM-unstranded values, as the data is already normalized (is it right?!). However, when looking for a pipeline to perform this analysis, I noticed that DESEq2 or edgeR/limma analysis only uses raw counts as input to identify these genes. Why I should not be performing DE with FPKM values? I mean, which kind of analysis are FPKM values okay to perform with? Many many thanks in advance.
if you want raw counts for DESeq2, follow this https://divingintogeneticsandgenomics.com/post/how-to-convert-raw-counts-to-tpm-for-tcga-data-and-make-a-heatmap-across-cancer-types/