no code implementations • 3 Jul 2023 • Chandrika Kamath, Juliette S. Franzman
We show how we can use the randomness of both the random projections, and the choice of initial centroids in k-means clustering, to determine the number of clusters in our data set.
no code implementations • 3 Jul 2023 • Chandrika Kamath, Juliette S. Franzman, Brian H. Daub
The characteristics of our data set are unique - the vector-valued outputs from each simulation are available at over two million spatial locations; each simulation is run for a relatively small number of time steps; the size of the computational domain varies with each simulation; and resource constraints limit the number of simulations we can run.