The Phase 3 publication, A global reference for human genetic variation and the Phase 3 Structural variation publication, An integrated map of structural variation in 2,504 human genomes are now available from Nature alongside a celebration of 25 years of the Human Genome Project
Both these papers are open access and should be free for everyone to read and download.
If you have any questions about the data these papers are based on or how to access it please email firstname.lastname@example.org
Recent project announcements
We have realigned the low coverage 1000 Genomes sequence data to GRCh38. We aligned to the full assembly including the GRC maintained alternate loci sequences and decoy and additional HLA sequences from the IMGT, our fasta file can be found in our reference directory. The alignment was carried out using a new alt-aware version of BWA-mem
The alignment files themselves can be found in the data_collections/1000_genomes_project/data directory.
Please note, these files are now being distributed in CRAM format, rather than BAM format. You can find more details about CRAM in this README
Full details of our alignment pipeline can be found in the alignment pipeline README
If you have any questions please email email@example.com
#ASHG2015 The 1000 Genomes Data Tutorial - 8th October, 7-9pm Baltimore Convention Centre, Room 327, Level 3
The 1000 Genomes Project is holding a tutorial giving and overview of the 1000 Genomes Project, how to access the data and explaining different use cases for the data.
This tutorial will be held in the Baltimore Convention Centre, Room 327, Level 3.
For more details about the program please see the tutorial page
No registration is needed.
Email firstname.lastname@example.org if you have any questions.
The 1000 Genomes Project is an international collaboration to produce an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts. This resource will support genome-wide association studies and other medical research studies.
The genomes of about 2500 unidentified people from about 25 populations around the world will be sequenced using next-generation sequencing technologies. The results of the study will be freely and publicly accessible to researchers worldwide.
Further information about the project is available in the About tab. Information about downloading, browsing or using the 1000 Genomes data is available in the Data tab.