Hi, I am writing a webapp to help people rapidly search through exome variants. I would like some publicly available test data sets for examples/tutorials.
I know people have previously asked for something similar:
- How To Obtain At Least 30 Realistlic Example Vcf Files?
- Where Can I Download Vcf Files For Publicly Available Data?
However I would specifically like exomes that are NOT healthy, ie have confirmed diagnoses of diseases, that do not have any restrictions on them. For instance something that requires written approval is out, as I want to put it on my public site.
Ideally I'd like:
- Famous / Well known disorders
- Different types, eg de-novo and a pedigree with Mendelian disorder
As I don't think you will find these datasets, could you just create make examples that show of your webapps functionality.
For instance, using the 1000 genomes exomes and randomly assigning people into case / control groups then displaying that data. This would, I think, be powerful enough, to show the apps functionality for any complex traits, but would be limited for the mendelian traits you mention.