How can I get the similarity matrix from minhash LSH?

736 Views Asked by z3r0 At 04 January 2018 at 14:01

I have read many tutorials and tried a number of minhash LSH, but it cannot generate the similarity matrix, instead it returns just similar data which exceeds the threshold. How can I generate it? My intention is to use the LSH results for clustering.

Original Q&A

There are 1 best solutions below

Has QUIT--Anony-Mousse On 05 January 2018 at 09:38 BEST ANSWER

The whole point of LSH is to avoid pairwise distances, because that does not scale.

If you then put the data into a distance matrix, you get all the scalability problems again!

Instead consider an algorithm like DBSCAN clustering. It doesn't need a distance matrix, only neighbors at distance epsilon.

How can I get the similarity matrix from minhash LSH?

There are 1 best solutions below

Related Questions in CLUSTER-ANALYSIS

Related Questions in LOCALITY-SENSITIVE-HASH

Related Questions in MINHASH

Trending Questions

Popular # Hahtags

Popular Questions