Hi, I assembled a genome using long and short-reads and then used GATK for genotyping. Once, I read that one paper assessed genome quality by remapping the reads to the reference, and when I checked the coverage, it was less than 99%, missing like 1.3% which doesn't seem a lot, but in terms of length it is.
I wanted to do two things. First, I wanted to get the coordinates of the unmapped regions and then get only the unmapped regions of the fasta reference, and then check again the BUSCO SCORE. In a similar context, I have the annotation gff3 file and I want to find how many genes are located in the unmapped regions.
Are there any programs that can help me do these tasks?
Thank you very much;