Hi there - I'm new to plink and genomics more generally. I am trying to generate a .map file with cM values using phased VCFs in PLINK (v1.90b6.21), but am having no luck getting the cM values despite using phased data. I need cM values for use with hap-ibd.
My vcfs were phased first with beagle and then shapeit, and look like this:
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT AI1 AI12 AI13 AI14 AI15 AI16 AI17 AI18 AI19 AI2 AI20 AI21 AI22 AI3 AI4 AI5 AI6 AI7 AI8 AI9
chr10 5246 . T A . PASS . GT 0|0 0|0 0|1 0|0 0|0 0|1 0|0 0|0 0|0 0|0 0|0 0|1 0|1 0|0 0|0 0|0 0|1 0|1 0|1 0|1
chr10 5250 . G A . PASS . GT 0|0 0|0 0|1 0|0 0|0 0|1 0|0 0|0 0|0 0|0 0|0 0|1 0|1 0|0 0|0 0|0 0|1 0|1 0|1 0|1
chr10 7339 . A G . PASS . GT 0|0 0|0 0|0 1|1 1|1 1|1 1|1 1|1 1|1 1|1 1|1 1|1 1|1 1|1 1|1 1|1 1|1 0|0 1|1 1|1
chr10 9662 . C T . PASS . GT 0|0 0|0 0|0 0|0 0|0 0|1 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0
chr10 12614 . G A . PASS . GT 0|0 0|0 0|0 0|0 0|0 1|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0 0|0
To generate the map files, I've tried a dozen variations of:
for POP in X Y Z
do
plink --vcf $VCFDIR/SNPs_phased_chromosomes_dataset1_${POP}.recode.vcf --map --out ${POP}
plink --bfile ${POP} --recode rlist tab --set-missing-var-ids @:# --out ${POP}.map
done
All my .map files have 0 for every value of the cM column.
Am I doing something wrong? I have .bed files for these populations, but I don't know how to check that phase information is included in them, nor do I know why it wouldn't be there considering that the files are phased.
Thanks for any help anyone can offer.
Reformatting my existing ref maps and combining them all into one file worked. Thanks!
Thanks for responding! This was really helpful as I now think I have the data I need to go off of.
My data is from multiple wild populations of P. reticulata (guppies, a small fish).
I do have recombination maps for guppies in general, but because it looked different from what I'd expected and I found others asking about generating the maps online, I thought I should be able to do it from what I had.
The recombination maps I have access to have a slightly different format, and are smoothed and given per-chromosome for guppies as a whole.
vs the format of a map generated by PLINK
If I can reformat what I might be able to get this to work...