Hi..
I'm new to ChIPseq analysis, I performed all step from QC to peak call using MACS2. I have MACS2 peaks file (narrow and broad peak file) for two different histone marks which are h3k27me3 and h3k4me3. I want to find the promoters around transcription start site (TSS plus and minus 1000bp) that have these histone marks in two different group. Can anybody help me with that?? What are the tools that I can use to do this.
Thanks,,
Hi Prakash.. Can I use the genome annotation file from genecode rather than ensemble for biomart to get the TSS regions??
GENCODE annotation is Ensembl annotation.
Hi, I agree that essentially GENCODE and Ensembl annotations are same but GENCODE has annotation from both HAVANA and Ensembl. As Munaj86 had this query that he/she wanted to access GENCODE from biomaRt, that's why I replied in that context.
Thanks
What is my username? What does that tell you?
The Havana annotators sit at the other end of the corridor to me. The Ensembl automatic annotators sit halfway down the same corridor. Every month, money from the Ensembl grant goes into my bank account. I know what I'm talking about.
munaj86: Prakash does not know what they are talking about. Do not listen to them.
The annotation presented in Ensembl is GENCODE. GENCODE is a brand name used to describe the annotation from Ensembl for human and mouse. This consists of the merged data from the Ensembl automatic pipeline and the Havana manual annotation. There is nowhere in the world that you can get hold of the automatic annotation alone or the manual annotation alone. You can only get hold of the two of them merged together and you can call it either the Ensembl annotation or the GENCODE annotation – it is the same.
Thanks Emily for clarification ! I have edited my comment.
yes you can use that as well
I did try to look at mart function and list the available human annotation file and it is only display ensemble, how can I use the gencode annotation file using usemart() function. I already have a gtf file of human genome file from gencode I just don't know how to use in mart() function.. Any suggestion??
As Emily clarified about GENCODE and Ensembl , you could use ensembl to get chromosome location in bed format and resize it to plus minus 1000bp.
This is wrong. GENCODE == Ensembl.
Thanks a lot for the code, @Prakash! Really helpful!!!