Using spike-in controls is a common way of evaluating statistical methods while finding differentially expressed genes. Having Fastq files containing ERCC controls and the corresponding gtf file for the ERCCs, how can one does the alignment step with TopHat? For instance if the samples are from human, we have the fastq file, hg19 reference and the ERCC.gtf file. How can one use TopHat to align the fatsq files to the reference genome while they include the ERCC reads? Should we combine the hg19 reference genome with the ERCC. gtf file? Following article can be an example of this situation:
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3166838/
How should we include the ERCC controls information to the reference genome used in Tophat?
Thanks for the help.