Hi, I need some help! I want to select my target genes generated by my ChIP-seq (HOMER - genes symbols) within my list of differentially expressed genes from RNA-seq (Cuffdiff output), so I can identify my target genes ids, “XLOC_” gene ids generated by cufflinks, (like select a complete line from my cuffdiff file where I find my target gene), this way I can plot the heatmap of these genes using RNA-seq data. How can I do this in a simple way?
->Annotated genes (HOMER output)
- SULT1B1
- LHFPL
- ZMYND8
- RAD23B ...
->Cuffdiff table file (RNA-seq)
- gene_id gene symbol locus
- XLOC_000001 DDX11L1 chr1:11868-31109
- XLOC_000002 MIR1302-2 chr1:11868-31109
- XLOC_000003 OR4G4P chr1:52472-53312 ...
Hello LuisNagano!
It appears that your post has been cross-posted to another site: http://seqanswers.com/forums/showthread.php?t=74116
This is typically not recommended as it runs the risk of annoying people in both communities.
What do you mean with "in a simple way"? Does it mean that you know how to use R for instance?
With R you can do this simple namely, your Homer output seem to be gene symbols, which I recognize in the second column of your cuffdiff table.
So import the files in R and use for example:
%in%