Hello everybody,
I'm a PhD student and I'm working with a viral transcription factor. My task is to find out where the transcription factor binds in the human genome and what it does there. The idea was that it would bind near promoters and activate human genes, since this is its job in the viral genome.
What I can see after ChIP-Seq is, that it seems to bind more or less everywhere. There are, according to MACS2 standard settings, more than 300.000 peaks, which are also visible if you have a look in IGV (See picture at the end of the post).
When I analyse the peaks with MEME/ DREME, I get binding motifs in up to 90 % of the peaks, which fits to the binding sites in the virus (the binding of the transcription factor at these sites was confirmed via EMSAs (electro mobility shift assays)).
By checking in IGV I can neither see an exclusive association with transcriptional start sites, exons or something else.
The RNA-Seq revealed that upon expression of the viral transcription factor cellular genes are upregulated (this is also its job in the viral genome) as well as downregulated (new feature). After 6 h the strongest upregulation is about 10 fold, the strongest downregulation to 5 % of the original expression. There are only about 10 genes strongly up- and 10 genes strongly downregulated, which was kind of strange, since the complete cells is reprogrammed.
My question would now be, if anybody knows if there are more transcription factors with about 300.000 binding sites and if people just pick the ones which fit to their concepts the best? And maybe anybody has an idea what this transcription factor could do with that many binding sites? I have some ideas but don't want to write them right now, not to push the discussion in a special direction. I'd be happy for any input!
By the way, maybe anybody knows how I can enter refSeqGenes to IGV and see the gene names instead of the NM numbers?
Thanks a lot, Alex
About the picture:
- 1st row: ChIP input
- 2nd row: ChIP transcription factor
- 3th row: MACS2 peaks
- 4rd row: RNA-Seq 0 h
- 5th row: RNA-Seq 3 h
- 6th row: RNA-Seq 6 h
- 7th row: RefSeqGenes
- 8th row: knownGeneTSS (transcriptional start sites)
Can you share what virus this is?
Hello,
it is Epstein-Barr-Virus
Hello ChIP-Tease!
It appears that your post has been cross-posted to another site: http://seqanswers.com/forums/showthread.php?t=60948
This is typically not recommended as it runs the risk of annoying people in both communities.
Hey Pierre,
thanks a lot for the hint. I thought it might help since these are independent forums. But for sure you are right. A lot of people will be in both forums. I will add the link in the other forum to the top of the post! Sorry, I'm completely new in forums. Have a nice day!
You said "expressed the TF", this would be in transfected cells, or were they infected with virus? Do you have a control in your RNA-Seq experiments (e.g. transfected for a non-TF EBV protein)? It would be important to know if the DEGs you've found are in response to the TF specifically and not in response to transfection.
It looks like the TF is binding the genome indiscriminately and may be capable of exerting some regulatory effect on the cells. Do you have peaks in any regions corresponding to the DEGs you've identified? You could validate the binding of the TF to those genes with EMSA.
I guess the final check would be to see if these effects hold up in EBV infected cells.