Skip to content
This repository was archived by the owner on Apr 22, 2020. It is now read-only.

Conversation

oschlueter
Copy link

The current implementation of Jaccard-similarity doesn't discard dupliate input values when calculating the denominator. I identified this issue by calculating Jaccard on identical input containing duplicates which didn't return 1.0 for which I added test cases.

@oschlueter
Copy link
Author

When creating the test cases I forgot to set the call to Similarities::overlapSimilarity. Once fixed I saw that the calculation is also affected so I added a propsed fix for overlapSimilarity as well.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

1 participant