#[2] What does nostrich/universe collecting Japanese text with lang=ja ? In my short trying, it seems to be included when the number of Japanese characters is greater than the number of alphanumeric characters.

Reply to this note

Please Login to reply.

Discussion

It uses FastText to do the classification. I think it depends on the training model.

Ah, I see. I'll switch the relay to relay-jp.nostr.wirednet.jp for the bot. Thank you.

The classification effect of long articles will be better, and short content needs to be pre-processed first.