Jaccard Similarity vs Cosine Similarity
https://datascience.stackexchange.com/questions/5121/applications-and-differences-for-jaccard-similarity-and-cosine-similarity Jaccard Similarity is given by s i j = p p + q + r s i j = p p + q + r where, p = # of attributes positive for both objects q = # of attributes 1 for i and 0 for j r = # of attributes 0 for i and 1 for j Whereas, cosine similarity = A ⋅ B ‖ A ‖ ‖ B ‖ A ⋅ B ‖ A ‖ ‖ B ‖ where A and B are object vectors. Simply put, in cosine similarity, the number of common attributes is divided by the total number of possible attributes. Whereas in Jaccard Similarity, the number of common attributes is divided by the number of attributes that exists in at least one of the two objects. And there are many other measures of similarity, each with its own eccentricities. When deciding which one to use, try to think of a few representative cases and work out which index would give the most usable results to achieve your objective. The Cosine inde...