I have been wondering why many around the world are re-processing some (identical) fraction of TCGA data again (as far as simple analyses are concerned e.g. RNAseq)? Surely these types of analyses have already been done (using a reasonable set of parameters/programs) and the result files are potentially publicly available.
These scenarios are specialized circumstances and are certainly applicable in some cases. I find it hard to believe that everyone trying to use TCGA data has a real need to download and re-process data.
I wonder how many users do it because they are not able to find processed data or know that they can use results from one of the available portals that provide access to derived data.