Here are the frequencies for the spammiest tokens on the damus relay if anyone wants to use these for spam filtering:

https://cdn.jb55.com/nostr/spammy.txt

Reply to this note

Please Login to reply.

Discussion

I feel sorry about this

Crap that’s a long list

Will do you consider using machine learning instead of key word to filter?

So many single word ones

Yeah, I know. I presume the presence of one word on the list is going to be insufficient to tag a post as spam.

It's tokenized, so it's a count per word of all notes, not complete notes.

I can see so many Chinese words.😂

They may be testing the robustness of the nostr system.

Chinese is king of spam lol

Absolutely, Chinese people are creative. 😂

这样很难防住中文垃圾信息的,我们得帮 #[3] 想想办法。

只能用relay過濾

实际上,我觉得全球频道是无用的。它只是获得新信息和新朋友的其中一种方式。

我们应该思考的是:获得新信息的方式都有哪些?如果有更好的方式可以获得新信息和新朋友,那么全球频道是否可以删除掉?如果不删除,是否可以做些限制(比如只有经过NIP05验证的用户才能在全球频道发帖)?

This shows how #Nostr is substantially prized by the Chintoks, proof that "communism" is only a barrier to the emancipation of a ‘Chinese bitcoiner’ & that only #Nostr protocol could start a symbolic fight against the desire for communist collaboration leading only to the isolation of Asia-minor with regard to "decentralization & the possible adoption of Bitcoin"..

哈哈哈哈,用关键词来屏蔽吗?很快你们就会尝到火星文的厉害了。🐶

不得不说,国内这个spam和anti-spam行业太发达了,领先业界一百年。

哈哈哈哈哈哈,没想到还能输出火星文

中文spam 真是臭名远扬🤡

这是一场没有尽头的较量

I prefer just IP blocking instead of keyword filtering for this reason.

垃圾人群。没灵魂,没思想。奴隶贱民只会搞这些垃圾事。

spammer是骂不死的👀

或者说,他们根本不看nostr。

用机器人来发“色情”垃圾资讯,污染别人,污染软件体验环境,这些人太邪恶了。

只能关注任意一位正常用户,再遵循他的列表关注了。

关掉Global就可以了,damus上可以选择中继,amethyst上可以关global流量

do spammers realize no one is seeing their messages? why do they bother?

他们很多广告是诈骗赌博之类的,净利润很高。

只要找到一个客户,可能发一年的spam都不亏。

Global 的牛皮癣垃圾还是有能各种办法治理的, 就担心后面那种 评论、 私信、 伪装、 钓鱼的骗子各种通知会很打扰正常用户

That’s done on the relay side right?

#[0]

Is this similar with relay.nostrgraph.net , nos.lol , nostr.mom have found till now #[2] #[3] ?

I haven't checked the word frequencies. If anyone needs transparency I might do a bot in the long term that 'lightly' publishes the spam filter decisions as events on nos.lol. Obviously not every event rejected is going to be published about.. Fresh pubkeys will probably won't be mentioned at all.

there is no ends

These Chinese user groups.

It's really rubbish.

It's all a group of liars.

Cheating money.

Lose the face of the Chinese people. They are all low-quality garbage people.

Originally, Chinese people used American software to exchange ideas, do meaningful things, and do valuable things.

But it was ruined by this group of soulless garbage Chinese.

😤😡

What if we compiled a similar token frequency table for non-spam? Could that be “subtracted” from this table for an even more useful table?

🤷‍♂️

what do you think the spammers will do with that table?

#[0]

Can I filter spam from amethyst?

I noticed that the spammers basically have no followers, or less than 10.

We can use this feature to restrict people who post on public channels. If someone wants to post on a public channel, he must have more than 10 followers.

基本上都是我们中国人,真的很让人失望!

#[1]​ have you considered adding more structured ways of providing these feedback loops? For example ability to rate nip05 verifications and nip58 badges?

That way people can organically assess the trustworthiness of these forms of verification real time