Hi
I am trying to create heatmaps. I have used heatmap.2, pheatmap and clustvis. I have dataset of 2000 rows containing allele frquency of different SNPs in different populations. I cannot filter rows as I am visualizing relationship between different populations on the basis of allele frequencies. The problems with R packages is that they does not create good resolution heatmaps. And clustvis plots heatmaps on the basis of principal components not on the basis of actual allele freqiencies.
Can anyone please tell me how to great good resolution heatmaps of such large datasets?
What do you mean by good resolution ? Typically, plotting a large data set such that all details are visible to the naked eye means making the plot bigger. If you want labels to be readable they should be in a font at least 7-8 pt (~10 pixels) which means this is also the minimum height of a heatmap cell so 2000 rows x 10 = 20000 pixels so for a typical printer resolution of 72 dpi, that's a paper height of 278 inches or about 7 meters. Which is why dimensionality reduction techniques are useful for visualization. Clustering is one possible approach but you may also not have to use heatmaps depending on what you're trying to show.
you could run k-means and then create k different heatmaps, one for each cluster. I don't think you'll get over the poor resolution with so many datapoints. For interactive work you could generate a zoomable heatmap in an app.