Repeatmasker Annotation Descriptions
0
0
Entering edit mode
7.9 years ago
morovatunc ▴ 560

Hi,

I have obtained Repeatmasker bed file for hg19 assembly via UCSC table browser. But I have seen that they are quite common across genome. Is there any wiki or something that I can get the information about th meaning of that annotation.

For example; I could only get this link related with this info. But it is not very informative. —> www.repeatmasker.org/cgi-bin/ViewRepeat?id=LTR40a —> LTR40a

Let this graduate student have a happy new years eve.

Best regards,

Tunc.

repeatmasker UCSC • 2.3k views
ADD COMMENT
0
Entering edit mode

What kind of information are you looking for?

As far as I know, the repeatmasker database is the definitive resource on repeats. If you want more detail, you would probably need to dig into the literature. But characterising repeats has not been popular for many years so information tends to be pretty thin.

ADD REPLY
0
Entering edit mode

I am working on an effect of transcriptional factor binding on transcriptional activity. I have a bunch of regions where I see this TF binding. But I would like to filter out the binding regions where for example bind on a highly repetitive region. therefore, I wanted to ask in my question that is there a specific annotation tag in repeat masker database which I assume those binding events are trash.

Thank you for your answer

ADD REPLY
1
Entering edit mode

In that case, I would filter out all annotated repeats without worrying about specific tags.

LINEs, SINEs and LTRs are most common in the assembled genome but the DNA/RNA/Simple repeats are also problematic in ChIP-seq (mapping issues etc.)

Having said that, I would also set a limit on the degree of overlap between your TF and a given repeat before tossing it out. It's best to eyeball a few regions to set this definition but something around 50% might work. For example, if your peaks are 300 bp, only exclude if >150 bp overlaps a repeat. Otherwise you might throw away a lot of peaks where the repeat overlap is incidental.

Or you could do the inverse and only select for peaks that overlap a gene.

ADD REPLY
0
Entering edit mode

Thank you very much for your extended answer. It will really help me a lot!

Best regards,

Tunc.

ADD REPLY

Login before adding your answer.

Traffic: 2524 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6