## *.vcf.gz

Files containing the variants that are used in the training process, block-gzipped VCF format.

## *.tsv.gz

Variants annotated with features, block-gzipped TSV format.

## *.csv.gz

Imputed and transformed training set (directly usable as X in any machine learning setting with first column being Y)
