Dear All,
I have file which looks like the following
chromosome start end peak.location chip.value target.gene.name distance.to.gene
chr1 162990333 162990703 162990519 33 RP11-331H2.3.1 136
Is there a way to define if this peak is in exon, intron, promoter or 3' UTR?
Do you know a way for that? Could you please give some quidelines?
thank you in advance
best regards Lena
Do you have a gene model file (GFF) for the species/organism you're working on? Then, its straightforward to extract the corresponding intervals for your peak from the GFF and obtain the
type
.no i don't have anything....how can i do that?
moreover isn't any other way to fix my problem? Any package in R?
GFF files are usually available from the same website where you download your reference. What are you working on? You can read more about GFF format here: http://www.sanger.ac.uk/resources/software/gff/spec.html
I haven't worked on ChIP data. So, I can't tell if the format is supposed to have the type annotated. Pablo's solution might be straightforward if you aren't working on plants I suppose.