Nostr Web Client

I’m thinking I’ll work on some type of npub authenticity rating next. Don’t have it figured out yet, but planning to use some of these:

- how repetitive is their kind 1 content?

- Kind distribution

- Tag distribution (only replies vs only root posts, mentions, what is normal mix?)

- Age of account

- Age of oldest follow, age of newest follow and the delta

- Centralized NIP-05 provider w/ bad reputation

- Profile very similar to existing one (impersonators)

- Mute lists

- Kind 1984 reports (spam only)

- Link in every post? Known bad links?

Other ideas?

ridsdszltdel 2y ago 💬 2

I think you can also add some gibberish (non sense random words or random string) detection. I have seen some model or space in huggingface. This can be helpful for detecting non sophisticated spammer bot which use random char or random non sense words.

Reply to this note

Please Login to reply.

Discussion

Mazin 2y ago

Good one, thank you. I looked in to this some a few months back when some kid was experimenting with randomly generated spam but never implemented anything.

It might be that we end up making this so complicated that it makes more sense to just label some data and train an own model instead of managing infinite rules. Fortunately, thats one of the many things nostr:npub1qlkwmzmrhzpuak7c2g9akvcrh7wzkd7zc7fpefw9najwpau662nqealf5y can handle if we get there.

ridsdszltdel 2y ago

Yes, probably you can make it as separate independent service (plugin) instead of rules for strfry policy. Especially if the process takes more than one second to classify one event content. Let the event comes first, process them in queue, and delete it later if it was detected as high probably gibberish spam.

Yes, i think you and nostr:npub1qlkwmzmrhzpuak7c2g9akvcrh7wzkd7zc7fpefw9najwpau662nqealf5y can manage that easily for those problem 🙂