Hello Pierre
Thank you for your reply. It was helpful. However, I am still confused in interpreting "./."
I have 50 samples listed in the .fof file. Upon completion, discoSNP++ (followed by vcfcreator) outputs a contig fasta file and a vcf file. The vcf file contains numerous rows, each corresponds to single variant, and 9 + 50 columns. These 50 columns corresponds to the variant information within 50 samples used. Now take an example row from the output vcf file:
SNP_higher_path_9480770 56 9480770 C T . . Ty=SNP;Rk=1;UL=6;UR=20;CL=.;CR=.;Genome=.;Sd=. GT:DP:PL:AD:HQ ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 0/0:11:5,37,224:11,0:66,0 ./.:1:.,.,.:1,0:68,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:5:.,.,.:0,5:0,63 ./.:0:.,.,.:0,0:0,0 1/1:1259:25184,3794,59:0,1259:0,66 ./.:0:.,.,.:0,0:0,0 1/1:43:864,134,6:0,43:0,65 1/1:38:764,119,6:0,38:0,64 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 1/1:34:684,107,6:0,34:0,66 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0 ./.:0:.,.,.:0,0:0,0
Here you see that most columns have "./." and some have "1/1".
Now my question is how should I interpret samples with genotype "./."? Should I interpret is as the contig "SNP_higher_path_9480770" is missing in this particular sample OR the contig is present but without any variation?
Hope you get my query. Thanks
./.
(for read seti
): the variant whose id is SNP_higher_path_9480770 has not enough corresponding reads in the read seti
.not enough means that both alleles are not read coherent (cf read coherent definition in the publication)
Pierre