Entering edit mode
5.7 years ago
Ekalavya
▴
10
rmsk primary table has > 5 million records for repetitive elements. There were 989434 entries belong to "LINE" class of repeat. When I checked the lengths of these LINEs, a majority of them (973906) were shorter than 5kb. I took the "genoStart" and "genoEnd" to calculate the lengths. Aren't these supposed to be Long interspersed nuclear element?
How many of LINEs are present in the mouse genome? What is there length distribution? How to get the genome coordinates (mm10) of LINEs?
Please help Thanks
You have the information in the file you downloaded. Note that most LINEs are not full length, which is why you're seeing so many below the canonical length.