I noticed that in IGV there are lower case regions (soft-masked). I want to exclude these in an analysis and was wondering if there was an annotation for the sites that are soft-masked. Like a bed file or GTF with all the specific soft-masked locations.
I could go through and identify the lower case regions manually but before I do that maybe I'm overlooking an available data source?
I'm looking at human data (hg38) but this is probably not super relevant to the question
A recent past thread about this : Repeat masked gtf files from ensembl
I don't think GTF files with masked locations are available.