Hello!
I'm starting gwas analysis and, actually I have quite a hard time on some points.The analysis were conduted by Affymetrix array and I have a case-control study. I have the .bed, .fam. bin, .ped, and .map files. However the files do not state which are the individuals cases and controls and also the sex is not included. I also need to remove some samples from the analysis.How can I edit these files, and which ones should I edit (ped and fam?) I've tried excel and I can not because the file is too large. The other question how can I change the Affymetrix codes to the rs name? Can the bed file be read in any way? Sorry if my questions are very basic, but really I'm a beginner in this area. Thank you!
maybe slightly off topic but if you are new to GWAS and Affy you may wish to take note of this http://www.affymetrix.com/support/help/faqs/dna_ge_arrays/faq_32.jsp
Many thanks all, your tips were useful, wonderful!! Now, I have other question about this error:
A problem with line 124 in [ C:\Users\leka_\Documents\DOUTORADO\gwas\gwas\Dataset\gwas_26032019\plink_text_format\JSB_AX001toAX011_final.ped ] Expecting 6 + 2 * 920636 = 1841278 columns, but found 1455707
When I'm try to remove this individual( line 124). I can not! How I can to solve this problem? Because any analysis running not is working.
Many thanks!!
Your PED file is essentially not in agreement with [most likely] your MAP file.
The PED file has 6 columns for:
Then, the remaining columns are genotype values, with 2 columns per genotype (so, 2 * the number of genotypes)