What do phase 1 and phase2 mean?
The 1000 genomes full project has been divided into phases to represent the dispersed nature of the sample collection.
Phase 1 represents low coverage and exome data analysis available for the first 1000 or so samples. The phase 1 low coverage data freeze is represented by the 20101123 sequence index. The phase 1 exome data freeze is represented by the 20110521 sequence index. Any variant calls made for phase 1 will be presented in the release directory of the ftp site.
Phase 2 represents an expanded set of samples, around 1500 in number. The new samples have been assigned to the sequencing centres and the sequence data will appear in the last quarter of 2011 and the analysis in the first half of 2012. The phase 2 analysis will include all samples from phase 1 and will build on lessons learnt during the phase 1 analysis process.