distribution of the number of clusters across samples
distribution of the number of unique samples per cluster
distribution of cluster sizes