I'm trying to work with files downloaded from ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20110521
What tools do I need to work with these files? I believe it is vcftools and tabix, right? The documentation is not clear at all.
I'm trying to work with files downloaded from ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20110521
What tools do I need to work with these files? I believe it is vcftools and tabix, right? The documentation is not clear at all.
http://www.1000genomes.org/using-1000-genomes-data
look at the file intitle "The 1000 Genomes Tools"
It's pretty clear.
I'm going to be working with SNPs. To clarify, this index has the SNP files, correct? ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/release/20110521/
As you can see from the names, the file
ALL.wgs.phase1_release_v3.20101123.snps_indels_sv.sites.vcf.gz
only contains the variant sites, whereas the files
ALL.chr1.phase1_release_v3.20101123.snps_indels_svs.genotypes.vcf.gz
etc.
contain all the genotypes for the 1000 Genomes individuals.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Maybe you could explain what you're trying to achieve? "Working with files" sounds very vague.
it depends on what you want to do. VCFtools will be good for most things.
Do I have to unzip the .vcf file before using vcftools?