Hi all,
I have the illumina paired end reads of four chromosomes in fastq format. Using the cat command, I merged all of the fastq files into one. I used the GATK hard filtering method to extract SNPs in vcf format after mapping with bwa mem. Next, I would like to find the nucleotide diversity, TajimaD, Fixation index etc from the SNPs. For example, find the diversity between chromosomes 1 and 2, chromosomes 1 and 3, chromosomes 1 and 4. I would like to know whether this approach is correct. How do I find the above parameters from vcf files or any other methods possible?
This question was also asked on bioinformatics SE: https://bioinformatics.stackexchange.com/questions/22663/nucleotide-diversity-and-tajimad-from-snps
Please keep in mind that posting the same question to multiple sites can be perceived as bad etiquette, because efforts may be made to address a problem that has already been solved elsewhere in the meantime.
The helpful thing to do if you do decide to post on multiple forums is to add a link to the other forum posts on each post so people will look at the other posts before investing their effort.