Concordance Of Ngs Data To Wg Snp Array Data
3
4
Entering edit mode
13.7 years ago
jvijai ★ 1.2k

Hi, We want to check the concordance of the NGS data obtained from targeted SOLiD sequencing with the Affymetrix 6.0 data generated on the same sample. Both are genomic germline DNA, not tumor.
Is there a simple tool that will take the SAM file or variant file and look for concordance with the individuals genotype file (say if extracted from PLINK)?

Also hoping to hear how to do this otherwise, such as looking for the positions and alleles and then doing comparison, tabulating the number of concordant vs discordants etc.,

Thanks

snp microarray next-gen sequencing allele • 5.8k views
ADD COMMENT
4
Entering edit mode
13.6 years ago

the affy6.0 chip has a translation file that has all the typed genome positions tagged, with reference and alternative allele. considering that NGS will only detect how your sample differs from the current reference, taking affy6.0 alleles and positions there are 2 obvious checks you may perform:

  1. check if all the SNPs that affy6.0 has typed as alternative allele WERE FOUND by NGS (hence considered variants).
  2. check if all the SNPs that affy6.0 has typed as reference allele WERE NOT FOUND by NGS (hence considered reference).

keep in mind that genotyping is "limited" to the fixed positions stored in the chip, so you won't be able to do anything in reverse direction, i.e. things that arise from NGS that are not described on the affy6.0 chip. but still this check will be powerful, since the typing quality of affy6.0 is quite high, so at least you'll be able to have ~1M straight checks (SNP positions) and another ~1M not-so-straight checks (CNVs), depending on your NGS experiment type.

ADD COMMENT
4
Entering edit mode
13.1 years ago
Adao ▴ 40

Believe it or not, my PI want me to do this by excel.

ADD COMMENT
0
Entering edit mode

Just have to say wow...

ADD REPLY
0
Entering edit mode

Wow... that's a tough job.

ADD REPLY
0
Entering edit mode

excel really? there's excel templates which work as analysis tools for bioinformatics from the microsoft website. NOT sure if they might have something like this. but conceptually, I think a linux 'join' command is what u need to work fast

ADD REPLY
3
Entering edit mode
13.6 years ago
Adam ★ 1.0k

If in VCF format, you can use the vcftools --diff options to do this. Alternatively, you could convert the format to PLINK and use the diff mode of that program.

ADD COMMENT

Login before adding your answer.

Traffic: 2604 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6