Hi,
The following two sites show substantially different sizes for the files including the integrated calls per chromosome.
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/phase1/analysis_results/integrated_call_sets/
http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/phase1/analysis_results/integrated_call_sets/
For example, integrated calls for chromosome 1:
ftp: 10/12/12 12:00 [GMT] 4,294,967,295 ALL.chr1.integrated_phase1_v3.20101123.snps_indels_svs.genotypes.vcf.gz
http: 12-Oct-2012 13:52 [GMT] 11G ALL.chr1.integrated_phase1_v3.20101123.snps_indels_svs.genotypes.vcf.gz
Any idea why this is the case?
When I try to download the larger files from the http link, the download is terminated after the first gigabyte with an error message indicating that the source could not be found.
Thank you Laura for this reply as well as the previous comments on the 1000G related questions, I really appreciate your help. In that case, I will refer to the md5 to validate the files that I have downloaded.