I followed the following tutorial for RNA-Seq data analysis. https://ycl6.gitbooks.io/rna-seq-data-analysis/
After performing the Differential expression analysis using R (https://ycl6.gitbooks.io/rna-seq-data-analysis/perform_de_analysis.html) the final output files are DE_analysis.gene.txt
and DE_analysis_transcript.txt
. The output file has the following format:
chr left right gid score strand name source biotype logFC logCPM F PValue FDR
1 14403 29570 ENSG00000227232 . - WASH7P havana unprocessed_pseudogene 2.890084925 2.977441296 103.3708945 8.25E-09 3.27E-08
1 89294 133723 ENSG00000238009 . - RP11-34P13.7 ensembl_havana lincRNA 11.51392342 1.605598781 167.1675845 2.10E-10 1.21E-09
Could someone please explain the file format? Especially left
, right
, logFC
, logCPM
, and F
. Which column represents the average gene expression value?
Thank you very much! I apologize for not being clear enough about the question, but your answer actually answered my question.