Hi Biostar Community,
I'm starting my PhD in a small company and my subject will require to mine data from literature transcriptional analysis (microarray/RNA-seq). I might also have to analyses a couple RNA-seq experiment myself.
I already analyzed microarray data, and I know you don't need a beast to do it, but I have read that for RNAseq, especially on human data, you need considerable computing power if you need to perform raw reads alignment
I don't have access to a computer cluster, and it is uncertain that I will.
I seek for advices about what solution are available for me ? Would a laptop be sufficient to perform re-analysis of published data ? Are published data mostly raw datas for RNA-seq ? Do I need to look at Desktop solution ? Or is it not possible without an external computer cluster ?
Thanks for your help !
Thank you both for your answer !
I do have an affiliation with a research institute linked with an hospital, but the access to a bioinformatic server from outside the institute still have to be discussed. You confirm my fear that most public accessible data are raw data.
I used galaxy before with Chip-seq data, but I had access to an institutional galaxy plateform. Now I would only have access to the open access, meaning that have a 250 GB of data limitation.
Another limitation might be network connectivity : Network is pretty bad here, may be upgraded, but for now, exchanging GBs of data is out of question...
What you said WouterDeCoster about original Pipeline is interesting. I don't see any reason why I should follow the paper pipeline if there is new/better available approach.
What about Bioconductor ? Is RNAseq analysis with R possible ? Does it make sense in term of computing power ?
Alright, then you can try the lighter methods. Those are definitely good, but it good be that for your assignment you needed to exactly replicate the analysis, so that's why I asked.
Typically you first need to do the (pseudo)alignment, followed by read counting (which can be R) followed by differential expression analysis (which is in R). So yes, you can do a lot in R/Bioconductor.