We have a dataset from 4 genotypes, each quadruplicated, and 6 time points.
So that is a LOT of data points. 446=96!
I have never worked with R yet (have some limited Perl / Unix experience though)
How best and from where should I teach myself the rationale for choosing one method of RNA-Seq data analysis (or more), and the practical knowledge for how to implement theory into practice?
SOme of things I want to do are: 1. to know the different ways to normalize (and understand the stat behind how and why to choose one method from them all) 2. to know how to hierarchical cluster into groups that have identical temporal profiles of expression 3. to know how to compare more than just 2 genotypes at a time 4. to know how to compare across time course and not just one time point etc..
Could someone please help me out with how to go about this learning process.
Thanks, TdM
I recomment getting a couple books on R to get started, like statistics with R and bioconductor case studies. The course materials are great, along with the bioconductor mailing list.