Hello all,
I would appreciate some advice. I have generated SNP sequences using DiscsoSNP-RAD and figured out a pipeline to import the data into STRUCTURE to look at population structures. I am trying to perform a similar experiment as Gauthier et al. did in their paper DiscoSnp-RAD: de novo detection of small variants for population genomics. The fasta file produces SNP sequences of upper and lower paths as demonstrated below:
SNP_higher_path_3|P_1:30_C/G|high|nb_pol_1|left_unitig_length_86|right_unitig_length_261| left_contig_length_168|right_contig_length_764|C1_124|C2_0|G1_0/0|G2_1/1|rank_1.00000
SNP_lower_path_3|P_1:30_C/G|high|nb_pol_1|left_unitig_length_86|right_unitig_length_261| left_contig_length_168|right_contig_length_764|C1_0|C2_134|G1_0/0|G2_1/1|rank_1.00000
My question is, should I use only one of the paths or both of the paths for STRUCTURE analysis? If one, which path would be best? I appreciate any feedback!
Thank you in advance
Hello Dr. Gauthier,
Thank you for the information! This is extremely helpful. I appreciate your time and help.