Entering edit mode
4.2 years ago
telroyjatter
▴
240
Let's say you have an RNA-seq counts matrix for two experimental groups, and you have already generated your table of differentially expressed genes. Generally speaking, what are the best ways to interrogate this dataset?
The ones I can think of:
- Gene ontology enrichment
- WGCNA and/or MEGENA for co-expression networks: modules; eigengenes, eigengene networks
- ARACNE for transcription factor-target networks
- Key driver analysis (i.e., which hub gene in WGCNA/MEGENA/ARACNE has the highest connectivity; what other ways are there to define key drivers?)
- HOMER for regulatory binding sites
Any other methods, reviews, papers are highly appreciated!
The first thing is "what is your biological question?" Every analysis depends on that response.
Let's say the tissue is from brain, and we want to understand how the administration of a drug affects gene expression.
So you have a list of genes and are looking for biological meaning in this list?
Or in the entire dataset, for example some methods include an entire counts matrix rather than just a list of DEGs or the counts of only DEGs.
Echoing JC here: if you're only thinking of a question after having generated the data, things have been done backwards.
Why is the list of methods you've listed not sufficient to get you started?
It is enough to get started. I don't even have data. I just want to create a list of all of the methods, with their respective applications, that can be used to mine, interrogate, and extract knowledge from an RNA-seq dataset (or *omics datasets more generally).
If my biological question is to understand how a drug affects the transcriptome, are there tools or a particular workflow that would be recommended?