There is a data set that I want to work with (ftp://ftp.cephb.fr/hgdp_supp10/Harvard_HGDP-CEPH/) to do some PCA analyses (see New to SNP data: I need help getting Affy Axiom data out of genotyping console into a Principle Component Analysis)
Anyways, I've downloaded the data set, but I have come to learn that the annotation file (ftp://ftp.cephb.fr/hgdp_supp10/Harvard_HGDP-CEPH/annotation.txt) and the map/ped files (ftp://ftp.cephb.fr/hgdp_supp10/Harvard_HGDP-CEPH/all_snp.ped.gz, ftp://ftp.cephb.fr/hgdp_supp10/Harvard_HGDP-CEPH/all_snp.map.gz) I need refer to the Affy SNP ids, not the dbSNP rs ids.
How can I convert the ids en masse to dbSNP rs ids?
Can anyone suggest a script that could help me do this? Thanks!
I have the same problem, anyone have a script to help us? Please. Thanks!
I got the answer, check the link: https://stackoverflow.com/questions/7846476/replace-column-in-one-file-with-column-from-another-using-awk