Why does the LogLikelihoodSimilarity function return values ​​greater than 1.0 for a dataset of 0s and 1s?

I have a large set of preference data that is expressed as 1.0, and I use the Tanimoto affinity functions and the common logical commands for user and item preferences. Recommendations typically represent values ​​from 0 to 1.0.

Many sources, such as the Mahout in Action book and this previous SO thread , recommend the LogLikelihoodSimilarity metric for Tanimoto for Boolean datasets. When I switched to the LogLikelihood affinity metric, it generated a few points in a much higher range, for example 11. I had to go back to Tanimoto to get more sensitive grades. Can you suggest any potential fixes, or am I misunderstanding the return values ​​of recommended results?

+3
source share
1 answer

, , . , 1.0 . , . [0,1] - .

+2

All Articles