I've noticed poor overlap between number of differentially expressed genes between CLC's 'Empirical Analysis of DGE' (edgeR test) versus Cuffdiff. CLC maps 20% more reads on average, and finds 200-400% more differentially expressed genes. It is to my understanding that CLC uses a proprietary read mapper that will usually give a higher percentage, while Tophat's probabilistic model spends more time resolving overlapping ends to reduce random placement. However, it is less obvious how to relax stringency in Tophat's parameters to account for the high amount of variation in the hybrids we sequenced versus our reference genome. Any insight would be greatly appreciated.