The way Whitecat-san classifies words and Mattn-san classifies words in Buzzword trends is a bit different I think
コードアップしました
https://github.com/WhiteCat6142/nostr-wordcloud
nostr:note1de82jwxjz0ven0ezxah9ewnu24y0kk6te4n8mjndjy48vgjl68zse7wsze
Discussion
I don't understand Japanese well but I think identifying words in Japanese seem tricky
Once you got it mastered, it’s so legible since Japanese has the three kinds of characters, kanii, hiragana, and katakana ;-)
My buzzword bot treats consecutive nouns as a single word. For example, '自己', '皇帝' and '感' are treated as one word '自己肯定感'. But wordcloud treats them as each words.
Thank you for explaining ☺️
So its just a matter of choice 👍 btw, I'm curious, what does Mattn-san and Whitecat-san use to classify words as nouns? In English, long back Stanford's Wordnet used to be popular for that