Entering edit mode
5.5 years ago
jstevenson2256
▴
10
I have two genome assemblies from two phenotypes of a non-model ant species. I have performed gene prediction on them both and have found a difference in the number of predicted genes. How do I find these differing genes? I can't compare gene IDs as they will be different in both assemblies and I can't pull their positions and compare them as the same gene has different coordinates in both assemblies simply due to the assemblies being different.
Any ideas on how I can compare these putative genes between the two genomes?
Maybe blast can help?
For non coding genes, UTRs etc it would also be good to generate a chain file for liftOver (provided only assemblies are different and species are same/very_similar) . I have used this pipe line https://github.com/wurmlab/flo with good results to generate chain files which then can be used by crossmap/liftOver to convert coordinates.
I also think generating your own chain files and use liftOver is a good shoot. Otherwise, extract the genomic sequence of each gene and then use blat or gmap to map the to the other assembly.