AI companies are apparently scraping nostr now (with an incredibly inefficient way).

ClaudeBot accessed nostr.at 12 MILLION times since december 16th.

(which is about a quarter of all requests - 48million). I'll do a deeper analysis later

Reply to this note

Please Login to reply.

Discussion

not surprising, everything is a REST interface to them

trying to find non-ai generated material in their quest for the holy grail of AGI

which they are never going to find

plus most of the content here is going to be blotted out by their programmed bias "fact checking" and whatnot

There's a mathematical formula no AI engineer wants to come to grips with. I think I told nostr:nprofile1qyfhwumn8ghj7ctvvahjuat50phjummwv5q32amnwvaz7tm9v3jkutnwdaehgu3wd3skueqqyzu7we2xhgry2mknq8v7227yn7jguu9xhu3g90n6rtnjj3mpyq3ackdvvhl about that very formula (though I may have gotten it wrong).

something to do with the energy cost of processing compared to how efficiently grey matter does it? or maybe the massive number of neurons and synapses in a human brain compared to even all the GPUs in the world?

I found it.

It's this: L(N,D) = [(Nc/N)DN/OD + D6/D]OD

Who is going to pay for the relays? Maybe that's an (not so fortunate) answer.

nevent1qqspa76sg505t5ma9vlxyvxvc6yyxrr68dnc00ykgm4lh6g33hgdyzgjl2444

Let's fill some fake accounts with gibberish data and let them untrain their models with it.

Oh don’t you worry, plenty of people are on top of this initiative already

Probably for the best that we don’t have a way to include alt text and user tags in images yet. 😬

The domain is forever fucked now.

They won't stop; I speak from experience.

sure you can have some kind of bot protection on the relay api level, no?

It seems reality confirm my note (the original thread was about relay policy) :

nevent1qvzqqqqqqypzqej7xe8nug4h8v3j48esuddpf87gjdvsz5y0ytyc2vwpf5trzzheqyghwumn8ghj7vf5xqhxvdm69e5k7tcpzpmhxue69uhkztnwdaejumr0dshsz9mhwden5te0vf5hgcm0d9hx2u3wwdhkx6tpdshsqgp6j2q7elp5lyas4n4dwhfh26ktjwe06ddnmf0ww8w7tjzmpmud9sejczll

Not sure if you are saying I'm feeding the AI or that privacy policies are useless?

You talk about companies scraping #nostr

My point is that i was saying it would happen.

And yes the original topic was about relay policies that can be claimed but not verified, so it would be useless.

I had talk about AI like a bad example of what "could" happen, and you give an news of what is happening.

Thanks for your news

Haa….urmmmm maybe that was partly my fault 😂

Are you sure it's them and not one of their users hooking up to an MCP toolbox?

Sure is a strong word when going purely on hse agent but it is ClaudeBot which is their craqling agent UA (https://support.anthropic.com/en/articles/8896518-does-anthropic-crawl-data-from-the-web-and-how-can-site-owners-block-the-crawler)

Interesting!