Why Ucsc Refseq Gene Position Overlap Gap?
1
0
Entering edit mode
11.5 years ago
gqyi20 • 0

Hello! everyone.

Recently, I downloaded chicken "refseq gene" file and "gap" file in UCSC, but I found some gaps overlapping refseq genes, even locating in genes, so what happend in this situation and why? Wheather we can discard those genes overlapping gaps or not? I can't search similar question about this, so I need your help. Thanks a lot!

A example is showed as follows:

refseq gene
#bin   name               chrom      strand    txStart txEnd
74     NM_001199320        chr1           -    1371140    1821501
gaps
chrom     chromStart    chromEnd     size
chr1      1581252       1614100     32848
chr1      1615830       1637787     21957
ucsc refseq gene • 2.4k views
ADD COMMENT
0
Entering edit mode

I don't know about the chicken sequence, but, theoretically, "gap" can be anywhere indicating that the underlying sequence is not known for some reasons and should be filled with Ns. Edit: I think you can check this by looking at FASTA at the gap's positions.

ADD REPLY
0
Entering edit mode

you can also align the sequences from refseq to the chicken genome to see the alignment.

ADD REPLY
0
Entering edit mode
11.5 years ago
deanna.church ★ 1.1k

The chicken assembly is a whole genome shotgun assembly and thus contains many gaps. In this view from the NCBI alignment (which may differ from the UCSC alignment): http://tinyurl.com/n2onshb

The second track is the 'Tiling Path' track and shows the order of WGS contigs and the gaps between them in this assembly.

ADD COMMENT

Login before adding your answer.

Traffic: 2701 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6