goodness of fit of umap

456 Views Asked by At

I'm given a 20122-dim matrix, each column is a 20-element vector. Now I apply umap to the matrix and get a 2122 matrix and a umap plot. How to measure the goodness of fit of this umap model? Is there any standard way to do that?

1

There are 1 best solutions below

4
Wuuzzaa On

UMAP has multiple purposes like clustering, supervised learning and outlier detection.

What exactly do you want to do with UMAP?

In case of clustering, you can take a look at sklearn cluster evaluation and compare the scores with other algorithms like t-SNE.

To look for the structure, you can reduce your data to 2-3 dimensions and use a scatter plot to eye check the results.

When you have labeled data, you can try to classify them with (nonlinear) classifiers like a random forest and compare the result score (e.g. accuracy) with other dimension reduction techniques like PCA.

Maybe you look for the trustworthiness from sklearn. You can compare the scores of PCA with the score of UMAP or any other dimension reduction algorithm. source