I have a large set of preference data that is expressed as 1.0, and I use the Tanimoto affinity functions and the common logical commands for user and item preferences. Recommendations typically represent values from 0 to 1.0.
Many sources, such as the Mahout in Action book and this previous SO thread , recommend the LogLikelihoodSimilarity metric for Tanimoto for Boolean datasets. When I switched to the LogLikelihood affinity metric, it generated a few points in a much higher range, for example 11. I had to go back to Tanimoto to get more sensitive grades. Can you suggest any potential fixes, or am I misunderstanding the return values of recommended results?
source
share