How much overlap to consider when annotating a region
1
0
Entering edit mode
10.0 years ago
Saad Khan ▴ 440

Hi I have chip and Dna methylation data in 300bp windows. I am trying to annotated these windows as falling into different kinds of regions e.g. intergenic,exon,intron,utr,promoter etc. I have already fetched this information from gtf file using the method described here using bedtools. Now I want to annotate my windows as either falling into either of these regions. I am confused about a few things and would be glad if someone could help me out.

  1. What percentage of overlap should I consider in order to unambiguously divide my tiles into either of the annotation.
  2. If I don't consider overlap and only consider regions to be belonging to certain kind of annotation then what parameters can be best used to do it.
bed annotation • 2.2k views
ADD COMMENT
0
Entering edit mode
9.8 years ago
PoGibas 5.1k

Simple answer would be: "Are you ready to try some R?!" :-)

See my answer in Overlap between 2 sets of genomic regions of differing size - use GenometriCorr, it will produce reliable results and nice visualisations.

ADD COMMENT

Login before adding your answer.

Traffic: 2674 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6