I have about ~50,000 gene models from an unannotated species, that I am looking to map back to a reference genome of a closely related one. The issue I keep running into when I am BLASTing the gene models to the reference genome is that multiple gene models are going to the same gene. I want to use some type of mapper (bowtie, gmap, ect), but that can potentially address this redundancy issue. Perhaps, if 3 gene models map to the same gene, only return the closest related match. Or, is there a way to see if these gene models are overlapping at all? Just trying to figure out the best course of action.
Cheers!