I have transposons data for mouse organised in the following manner (txt)
chr start end hit
1 111 130 geneXXX
2 1546 1867 intergenic
3 123 234 geneYYY
this is for the assembly 38, Ensembl release 73.
I want to compare it now with the release 90 of Ensembl (which is still assembly 38 of mouse) to confirm if the hits of the transposon are still accurate with most updated version. (e.g. to know if the transposon on chr 2 is still hitting on an intergenic region or if that region is now attributed to a gene. Or if the region in chr 3 where my transposon hit is still annotated as geneYYY and so on).
Are there any (semi) automated tools to do it? Which would be the best approach?
Thank you !
I would suggest just re-annotating your data from scratch (starting with the first three columns), assuming that you have code to do the annotation in the first place. The changes in the Ensembl genes will not be captured by "coordinate conversion", so do not spend any time looking at tools that do that. Instead, look for approaches to annotate genomic regions from a gtf/gff file.