I want to find genomic regions conserved between species. Therefore I downloaded the mm9rn4 net file from UCSC, as I was adviced by my professor. However, I do not understand how to interpret the net format (The UCSC description of the file format is located here: http://genome.ucsc.edu/goldenPath/help/net.html) A few questions follow:
1) There are two types of "Classes": gap and fill. Does fill mean an overlapping region in the alignment?
2) The fill regions are of different lengths. I.e.
fill 3000305 14924932 chr5 - 106694 16612310
Means that the region in mouse is 14924932 long and the region in rats is 16612310 long.
So even though I know that region x in mm9 "equals" region y in rn4, these regions are of different length. Therefore I cannot know that the subregion that begins 5000 basepairs after the start of region x "equals" the subregion that begins 5000 nuclotides after the start of region y or what?
3) One field describes the "relative orientation" between target and query species. It is either + or -. What does this mean? How should a plus sign be interpreted?
Thanks for the patience.
Edit: I see the format is nested, so the above descriptions/thoughts are probably way off.
See A: Translating genomic locations between species for more info