Viewing a single comment thread. View all comments

neriticzone t1_j2iln01 wrote

Not sure if I understand this question but isn’t this what the Dice coefficient is used for?

1

uwashingtongold OP t1_j2ilqes wrote

Sure, but I want to see if there’s a method to establish similarity, not just to measure it.

1

uwashingtongold OP t1_j2ilrbt wrote

Like a significance threshold maybe

1

Mental-Swordfish7129 t1_j2llbwf wrote

What if you encode the data with high-dimensional binary vectors and utilize a sparse distributed memory? I've used this approach many times with models I've built and you can measure semantic (Hamming) distance between data and you have a latent space for what similar data would have to look like. It's similar to a self-organizing map approach.

1