Hi!
Okay this is an open question for getting some ideas.
Imagine if you have a ChIP sequencing data on some disease say Type 2 diabetes (strictly for the sake of example) and you have this data from 3 different labs, which means three different sets of data on 3 different cell types.
(This ChIP sequencing data is on binding of a transcription factor X in a diseased state).
As an bioinformatician what you will first aim to get out of this data, I mean in terms of your goal(s).
Offcourse you will look for the regions which are conserved across the cell lines and the regions which are not unique.
But what will be your generic plan to get the most out from a dataset like this.
All suggestions are welcome.
Thank you
If somebody came to me with a dataset, or 3 datasets, on which they had performed ChIP (so they did not only collect DNA but also knew what antibody to order so what transcription factor they were interested in) without a clear goal... I would first of all doubt their sanity. They must have an idea what they want to know before they start those experiments. Maybe I get your question all wrong, but for me it doesn't make sense. There must already be a goal.
Hi Chris, The experiments were performed in three diffferent labs with three different goals. This resulted in genome wide DNA binding data for same TF on three different cell lines. Now, if such a data is publicly available, what would you do with it?
I Agree with Chris, it is still insane. Think about the questions FIRST, then look for the publicly available data that might answer these questions. Not the other way around. If you don't have ideas, read. What is your expertise? Biology? Computer science? HOwever, it is unlikely that you think about a truly interesting question out of nowhere. Find a project that already has clear questions first. If you are lucky interesting questions will arise. And then you might want to use already available data to answer them.