You could also do unique word comparisons with a similar weight matrix that might be faster?
Strip a note down to its set of unique words. Then compare the sets in a matrix of all notes compared to all other notes in the matrix, Count the words in common between two sets. Divide the count by the number of unique words or something else to normalize.
There are a lot of directions you could go in.