I am creating the index reference in order to align my fastq reads. One question I have is should I include all the "funny" chromosomes like the chr6hapcox or chrUnxxxxxxx?
I will be using GATK downstream for other parts of the analysis, and I know it doesn't like when the chromosomes are out of order, so what should I do as regards these non-canonical chromosomes which don't have (at least to me) an implicit ordering?
I realise this, but the problem is it will cause GATK to throw an error when parsing the BAM files. Any ideas on how to get GATK to not break?
you can map with your favorite tool and filter the funny hits from the SAM/BAM file after that.