Where are your sequence files located?

Our sequence files and our alignment files are all located underneath the data directory on the ftp site. There are indexes both data sets at the route of the ftp site sequence.index and alignment.index. Dated versions of every index released are also found in the sequence and alignment_indices directories.

Main releases of our variants can be found under release in directories named in the form YYYYMMDD. These dates should match the date of the sequence and alignment index the release is based on. There are old directories in the form YYYY_MM as previously we have named directories for the month they were released in. There will also be intermediate variant calls found under technical/working on the ftp site but please be careful with any data you find here as it is likely to be experimental and subject to change.

Our reference data sets can be found in technical/reference/ and this includes items like the reference genome, ancestral alignments and standard annotation sets.

A frozen set of the pilot data is also available under pilot_data. This follows the same structure as the main ftp site and also includes reference data sets for the pilot study publications.

All phase 1 BAMs have been moved to a phase1 freeze directory here

The trio high coverage data (pilot2) mapped to GRCh37 have been moved to here

The exon-targetted data (pilot3) mapped to GRCh37 have been moved to here.

You can also find files on our ftp site by using our ftpsearch