Dear All,
What all analysis we can do after the Denovo assembly for Transcriptome sequencing data of a new species. As the transcriptome data is not published before. I can mention some such as..
- Do a Pathway analysis using KEGG.
- Do a GO Annotation.
- Try OrthoVenn to plot graphs for protein orthologs.
- Try a BGI-WEGO plot.
This might be a broad question. But your inputs can help lot of researchers who are new to the Genomics and can give some ideas. Even a small input from you can be helpful.
Thank you.
Regards,
Amk
Any analyses that seek answers to the research questions that were set prior to sequencing. Fishing expeditions are rather pointless IMO.
Hi,
Sorry I have edited question. As it is a assembly to draft transcriptome for the first time. Thank you.
Amk
Perhaps you're misusing the term "fishing expedition" which is known as multiple testing. This is where you perform many different statistical tests and then choose the best one "If you torture the data long enough, it will confess to anything" - R. Coase. I'm sure we can all agree this is a big NO NO!
However, I have seen the term "fishing expedition" used to describe something quite different from multiple testing. Which is finding hidden internal structure of high-dimension data, generating hypothesis for future studies, data visualization, data mining (combining genomic data with publicly available dataset's like patient medical records) etc. I can't really understand why you would be against this? The viewpoint seems quite antiquated. For example, here is an excerpt from the book Philosophy of Complex Systems (which I won't be reading because I'm not willing to spend 198 euros on it)
[IMG]http://i65.tinypic.com/k1ym1s.jpg[/IMG]
IMO "Fishing expeditions" used in the above context HAVE made way for unexpected, ground-breaking discoveries in biomedical research. Much of what we know about how cells function has come from unbiased screening approaches. For example, untargeted metabolomics studies are hypothesis-generating by design and are the most exciting in terms of discovering biomarkers or elucidating metabolic profiles. Along a similar line of thinking RNA-seq is superior to qPCR where you only look at a small panel of genes of interest - you may miss gene's you weren't even thinking about a priori. Also, in single-cell RNA-seq when looking at a highly hetergenous cell population in tissue/tumor we are still finding new cell types (and subtypes) through the use of unsupervised machine learning techniques (e.g. by clustering data with PCA, tSNE, etc.) or tracking how cell's proceed through development using imputation techniques (e.g. MAGIC). There's my two-cent's FWIW
http://www.dictionary.com/browse/fishing-expedition
I fail to see how posting a link to dictionary.com counts as a constructive conversation. Could you please elaborate how 1) Doing a Pathway analysis using KEGG 2) Doing a GO Annotation qualifies as NO "clearly defined plan or purpose in the hope of discovering useful information"?