Do you prefer gene, exons or transcripts or all 3 and which tool do you prefer and why ?
I think transcript would be an ideal unit for summarizing the count data. But I have seen studies that summarize count data in gene level and would like to know if there is any relative benefit in using one or the other.
PS. I wanted to post this as a poll, but unfortunately we don’t have that feature yet biostar - till then lets discuss it in the normal way.
There isn't any single answer to this question. It all depends on what kind of biological question you're trying to answer with the RNAseq data. If I'm looking for differential exon usage due to spliceosome mutations, gene-level data is useless to me. If I'm trying to work with a huge network of genes, I may need to simplify my inputs and use gene-level metrics to make the problem tractable.
I agree with the context of question here. But I was amused by the trend of interpreting/summarizing RNASeq data in the level of genes and not in transcript level. Given that human genes have an average of ~3 transcripts/gene (Ensembl) and the function/biotype ( for example coding, non-coding NMD, intron retention, pseudogene etc http://vega.sanger.ac.uk/info/about/gene_and_transcript_types.html) of each of them varies summarizing in gene level seems to be a backward step. I was wondering if there is any method/study that compared by summarizing data in different levels. IMHO, it in level of genes without considering the transcript/biotype information may bias the results. I am still working on our RNASeq data to show this limitation in a systematic way. I wanted to know if anyone had looked at this more closely in a methodology or a large-scale analysis paper.