## *.vcf.gz Files containing the variants that are used in the training process, block-gzipped VCF format. Same as CADD v1.6. ## *.npz Imputed and transformed training set (directly usable in machine learning setting) in sparse numpy format. See example python code in toplevel directory.