## *.vcf.gz

Files containing the variants that are used in the training process, block-gzipped VCF format. Same as CADD v1.6.

## *.npz

Imputed and transformed training set (directly usable in machine learning setting) in sparse numpy format. See example python code in toplevel directory.
