Entering edit mode
2.5 years ago
Nai
▴
50
For association study what type of data file will be needed? I have Chromosome, position, Allele information, snp id. My queries are:
- To calculate allele frequency
- What should I consider case and control. If I have disease based parameters?
- What will be the data file in plink
- How can I calculate ODDS ratio
Dear Pedro,
I don't have vcf file. I have case and control with growth parameters, rsid, chrom, position. I know the steps like QC which should give output Max Allele Frequency, frequency estimation followed with HWE (OR ratio and pvalue) and logistic regression. I would like to know how to read text file in plink. Plink shows --file mydata.
Then you can code a script to convert manually to tped format
The tabular file should contain 4+2n columns, where n = number of samples. The first 4 columns should be Chromosome, Variant ID, Genomic position (in centimorgans) and Base-pair position. The next pair are the calls for the first ind in the tfam file, the next pair the calls for the second ind, etc.