I got lots of SNV and Indel somatic mutations from whole genome sequencing data from tumor. And I want to annotate these somatic mutations with whether located in the homopolymer region of human genome. I have already search the ucsc repeatmasker track, and it does not contain homopolymer annotation. I want to know where to download this annotation or how can I calculate the homopolymer of human genome? Thanks !
Thank you very much, I will try this method.
@Pierre Lindenbaum, -A HomopolymerRun isn't mentioned in the documentation on the GATK site or when you run "gatk VariantAnnotator --help" on the command line. Do you know what happened to this option or of another method to find all homopolymer regions in the human genome?