Entering edit mode
9.3 years ago
areyoujokingme
▴
70
I just realized I have NEVER seen GWAS data. I talk about it all the time and study results from many GWAS studies, but I don't know what the data matrix looks like for this. Can someone give me some idea of the structure of GWAS data? All datasets I am aware of are protected/private. I don't need a dataset - just want to be able to describe the data to someone in the elevator, etc.
Thanks for your thoughts!
Do you want to see how the genotype data is represented or do you want to "see" the whole analysis pipeline?
I want to understand and be able to describe what the raw data looks like. The whole analysis pipeline wouldn't hurt me, honestly (haha)! Do you have any ideas?
If you only want to know how the raw data "looks" like, you can either download the hapmap samples (plink format) or you can look into this post: Publicly Available, Individual-Level, Gwas Data? those resources might provide you a starting point.
However, I am uncertain how the data looks like before the plink kicks in. Hopefully someone else can show you those