Hi, 👋
I was looking through Biostars to find some good resources to get started. Unfortunately, most of the posts are really old 4+ years and many links are dead.
I was hoping to make a small collection of good explanatory reads for learning more about the QC part of GWAS. If someone has some tips, tricks, protocols, notebooks or other useful resources please share by posting.
Thanks
I can add this which I liked: A tutorial on conducting genome‐wide association studies: Quality control and statistical analysis https://doi.org/10.1002/mpr.1608 It has a git with all the files to follow and replicate the files and process..
What is your starting point? Do you have the genotypes or you should establish that first?
Thanks for replying. Right now I just have raw genotype data. I am starting from scratch.
Although QTL rather than GWAS but QC should be the same: https://kbroman.org/qtl2/assets/vignettes/do_diagnostics.html In addition to avoid bad markers we now interpolate evenly dispersed positions in the genome instead of using the actual markers.
Thanks for the advice - I will definitely look into that
Please add the links to your post.