Combined Annotation Dependent Depletion (CADD)

July 29, 2015: Webserver v1.3 submissions including InDels and annotations rerun

We were kindly informed that webform submissions including InDels while requesting results including annotations were missing the InDels for v1.3. The source of the incomplete files was a typo in one of the cluster submission scripts, causing the commands to not be executed. We reran these submissions and ask users to download their files again. If you do not have your download link available, upload your file again and you will be immediately directed to the download page.

July 11, 2015: Scores and scripts for CADD v1.3 released

In CADD v1.3 (3rd developmental/minor release), we fix some minor issues identified with CADD v1.1 and v1.2 related to overlapping gene annotation. This release is still based on the GRCh37/hg19 genome build and we are missing many annotations for the new genome build, not allowing us to release a version for GRCh38 at this point. As for CADD v1.1 and v1.2, many of the results for the validation sets are similar or better than for our last major release (CADD v1.0). We would like to highlight again that there is a measurable reshuffling of variant ranks to the last major release and we see differences in the score ranges that are obtained for certain predicted functional consequences compared to the last major release. Please find further information in the release notes.

For scoring your variants locally with CADD v1.3, we are providing the required annotation tracks (same as for CADD v1.2 with one update), an updated set of primate whole genome alignments and set of scripts. Provided pre-scored variant sets of CADD v1.3 can be used with these scripts to first retrieve pre-scored variants without recalculating their scores. Please check the installation instruction detailed in the README provided with the scripts (132K), before downloading all annotations (206G).

Jan 21, 2015: Scores and scripts for CADD v1.2 released

In CADD v1.2 (2nd developmental/minor release), we fix some minor issues identified with CADD v1.1. CADD v1.2 is still based on the GRCh37/hg19 genome build. As for CADD v1.1, many of the results for the validation sets are similar or better than for our last major release (CADD v1.0). However, we would like to highlight that there is a measurable reshuffling of variant ranks between versions and we see differences in the score ranges that are obtained for certain predicted functional consequences. Please find further information in the release notes.

For scoring your variants locally with CADD v1.2, we are providing the required annotation tracks (same as for CADD v1.1) and a set of scripts. Provided pre-scored variant sets of CADD v1.2 can be used with these scripts to first retrieve pre-scored variants without recalculating their scores. Please check the installation instruction detailed in the README provided with the scripts (132K), before downloading all annotations (206G).

Jan 1, 2015: Scripts and annotations for CADD v1.1 released

If you want to locally score variants without using our webserver, we are now providing the required annotation tracks and a set of scripts for scoring your variants locally with CADD v1.1. You can use the provided pre-scored variant sets to first retrieve pre-scored variants using the provided scripts without recalculating their scores. Please check the installation instruction detailed in the README provided with the scripts (132K), before downloading all annotations (206G). Note that v1.1 has a known bug which will be fixed with the release of CADD v1.2.

Dec 19, 2014: Bug in v1.1 release identified

When multiple SNV variants at the same site are scored, DNA structure predictions for the first variant are considered for all variants in v1.1. This causes the scores distributed in the whole genome SNV files to be off (as three alleles were considered for each position during scoring). This effects the conversion of unscaled to scaled scores. It also causes score differences between the 1000 Genomes, ESP, and ExAC files when compared to the whole genome file. We will fix this in a v1.2 release.

Dec 2, 2014: CADD v1.1 release

In CADD v1.1 (developmental/minor release), we use a slightly extended and updated annotation set and we trained a logistic regression model using GraphLab Create v0.9.1. CADD v1.1 is still based on the GRCh37/hg19 genome build. Even though results reported for many of our previously used validation sets are similar or better with this release, we would like to highlight that there is a measurable reshuffling of variant ranks between versions. We also see differences in the score ranges that are obtained for certain predicted functional consequences. Please find further information in the release notes.

Nov 5, 2014: Scores for ExAC r0.1 variants available for download

ExAC variant scores are available in the download section. These variants were downloaded from ftp://ftp.broadinstitute.org/pub/ExAC_release/release0.1/. Further information about the Exome Aggregation Consortium (ExAC) and the availability of these variants is available at http://exac.broadinstitute.org/

Oct 14, 2014: CADD is running on a new webserver

CADD has been running on a virtual webserver for some time. We have now moved to a new physical webserver with fast local storage. This will allow for faster downloads of the large files that we are providing.

Aug 4, 2014: Set of pre-scored InDels released

If you need to score many variants or score variants on a regular basis, you can download all possible SNVs and this set of InDels and then only score new InDels through the webinterface. In this case, we recommend that you start building a local database of InDels that you previously scored and check against these before running new variants. To initiate a local database of CADD scores, we provide a set of 12.3M InDels for download.

Feb 2, 2014: CADD manuscript is published

Our manuscript describing the method and features of CADD v1.0 was published by Nature Genetics:

Kircher M, Witten DM, Jain P, O'Roak BJ, Cooper GM, Shendure J.
A general framework for estimating the relative pathogenicity of human genetic variants.
Nat Genet. 2014 Feb 2. doi: 10.1038/ng.2892.
PubMed PMID: 24487276.

Oct 15, 2013: CADD webserver goes online

© University of Washington and Hudson-Alpha Institute for Biotechnology 2013-2014. All rights reserved.

 

Terms and Conditions and the Online Privacy Statement of the University of Washington apply.