AI companies are apparently scraping nostr now (with an incredibly inefficient way).
ClaudeBot accessed nostr.at 12 MILLION times since december 16th.
(which is about a quarter of all requests - 48million). I'll do a deeper analysis later
AI companies are apparently scraping nostr now (with an incredibly inefficient way).
ClaudeBot accessed nostr.at 12 MILLION times since december 16th.
(which is about a quarter of all requests - 48million). I'll do a deeper analysis later
not surprising, everything is a REST interface to them
trying to find non-ai generated material in their quest for the holy grail of AGI
which they are never going to find
plus most of the content here is going to be blotted out by their programmed bias "fact checking" and whatnot
There's a mathematical formula no AI engineer wants to come to grips with. I think I told nostr:nprofile1qyfhwumn8ghj7ctvvahjuat50phjummwv5q32amnwvaz7tm9v3jkutnwdaehgu3wd3skueqqyzu7we2xhgry2mknq8v7227yn7jguu9xhu3g90n6rtnjj3mpyq3ackdvvhl about that very formula (though I may have gotten it wrong).
Who is going to pay for the relays? Maybe that's an (not so fortunate) answer.
nevent1qqspa76sg505t5ma9vlxyvxvc6yyxrr68dnc00ykgm4lh6g33hgdyzgjl2444
Let's fill some fake accounts with gibberish data and let them untrain their models with it.
Oh don’t you worry, plenty of people are on top of this initiative already
Probably for the best that we don’t have a way to include alt text and user tags in images yet. 😬
The domain is forever fucked now.
They won't stop; I speak from experience.
sure you can have some kind of bot protection on the relay api level, no?
It seems reality confirm my note (the original thread was about relay policy) :
nevent1qvzqqqqqqypzqej7xe8nug4h8v3j48esuddpf87gjdvsz5y0ytyc2vwpf5trzzheqyghwumn8ghj7vf5xqhxvdm69e5k7tcpzpmhxue69uhkztnwdaejumr0dshsz9mhwden5te0vf5hgcm0d9hx2u3wwdhkx6tpdshsqgp6j2q7elp5lyas4n4dwhfh26ktjwe06ddnmf0ww8w7tjzmpmud9sejczll
Not sure if you are saying I'm feeding the AI or that privacy policies are useless?
You talk about companies scraping #nostr
My point is that i was saying it would happen.
And yes the original topic was about relay policies that can be claimed but not verified, so it would be useless.
I had talk about AI like a bad example of what "could" happen, and you give an news of what is happening.
Thanks for your news
The bots are coming 🤖 ... If only we made it easy for them to pay us ;) nostr:nprofile1qyxhwumn8ghj7mn0wvhxcmmvqy28wumn8ghj7un9d3shjtnyv9kh2uewd9hsqg9lc6hcy3xu9pv7lh7saqdx5705acu4h3u2eveq9dhjs7su5w38kvgy3cya
Empresas de iA já estão coletando dados do nostr
Haa….urmmmm maybe that was partly my fault 😂
Are you sure it's them and not one of their users hooking up to an MCP toolbox?
Sure is a strong word when going purely on hse agent but it is ClaudeBot which is their craqling agent UA (https://support.anthropic.com/en/articles/8896518-does-anthropic-crawl-data-from-the-web-and-how-can-site-owners-block-the-crawler)
Interesting!