Nostr Web Client

aljaz 8mo ago

AI companies are apparently scraping nostr now (with an incredibly inefficient way).

ClaudeBot accessed nostr.at 12 MILLION times since december 16th.

(which is about a quarter of all requests - 48million). I'll do a deeper analysis later

Reply to this note

Please Login to reply.

Discussion

Gigi 8mo ago

not surprising, everything is a REST interface to them

ᴛʜᴇ ᴅᴇᴀᴛʜ ᴏꜰ ᴍʟᴇᴋᴜ 8mo ago

trying to find non-ai generated material in their quest for the holy grail of AGI

which they are never going to find

plus most of the content here is going to be blotted out by their programmed bias "fact checking" and whatnot

Neigsendoig Cocules 8mo ago

There's a mathematical formula no AI engineer wants to come to grips with. I think I told nostr:nprofile1qyfhwumn8ghj7ctvvahjuat50phjummwv5q32amnwvaz7tm9v3jkutnwdaehgu3wd3skueqqyzu7we2xhgry2mknq8v7227yn7jguu9xhu3g90n6rtnjj3mpyq3ackdvvhl about that very formula (though I may have gotten it wrong).

ᴛʜᴇ ᴅᴇᴀᴛʜ ᴏꜰ ᴍʟᴇᴋᴜ 8mo ago

something to do with the energy cost of processing compared to how efficiently grey matter does it? or maybe the massive number of neurons and synapses in a human brain compared to even all the GPUs in the world?

Neigsendoig Cocules 8mo ago

I found it.

It's this: L(N,D) = [(Nc/N)DN/OD + D6/D]OD

Victor Stabile 8mo ago

Who is going to pay for the relays? Maybe that's an (not so fortunate) answer.

nevent1qqspa76sg505t5ma9vlxyvxvc6yyxrr68dnc00ykgm4lh6g33hgdyzgjl2444

Danz# 8mo ago

Let's fill some fake accounts with gibberish data and let them untrain their models with it.

Eric FJ 🪬⚡️ 8mo ago

Oh don’t you worry, plenty of people are on top of this initiative already

The Daniel 🖖 8mo ago

Probably for the best that we don’t have a way to include alt text and user tags in images yet. 😬

Abstract Equilibrium 8mo ago

The domain is forever fucked now.

They won't stop; I speak from experience.

matevz 8mo ago

sure you can have some kind of bot protection on the relay api level, no?

btcttombola 8mo ago

It seems reality confirm my note (the original thread was about relay policy) :

nevent1qvzqqqqqqypzqej7xe8nug4h8v3j48esuddpf87gjdvsz5y0ytyc2vwpf5trzzheqyghwumn8ghj7vf5xqhxvdm69e5k7tcpzpmhxue69uhkztnwdaejumr0dshsz9mhwden5te0vf5hgcm0d9hx2u3wwdhkx6tpdshsqgp6j2q7elp5lyas4n4dwhfh26ktjwe06ddnmf0ww8w7tjzmpmud9sejczll

aljaz 8mo ago

Not sure if you are saying I'm feeding the AI or that privacy policies are useless?

btcttombola 8mo ago

You talk about companies scraping #nostr

My point is that i was saying it would happen.

And yes the original topic was about relay policies that can be claimed but not verified, so it would be useless.

I had talk about AI like a bad example of what "could" happen, and you give an news of what is happening.

Thanks for your news

Max 8mo ago

The bots are coming 🤖 ... If only we made it easy for them to pay us ;) nostr:nprofile1qyxhwumn8ghj7mn0wvhxcmmvqy28wumn8ghj7un9d3shjtnyv9kh2uewd9hsqg9lc6hcy3xu9pv7lh7saqdx5705acu4h3u2eveq9dhjs7su5w38kvgy3cya

nostr:nevent1qvzqqqqqqypzpml96ysd7rxzjra8fpe8ldz6cjru4tf5d48j9yatq60g7q0u2xvpqy88wumn8ghj7mn0wvhxcmmv9uq36amnwvaz7tmwdaehgu3wvf5hgcm0d9hx2u3wwdhkx6tpdshsqgq7ldgy28696d7jk0nzxrxvdzzrp3arkeu8hjtyd6lmaygcm5xjpydgk269

novo 8mo ago

Empresas de iA já estão coletando dados do nostr

nostr:nevent1qqspa76sg505t5ma9vlxyvxvc6yyxrr68dnc00ykgm4lh6g33hgdyzgpr9mhxue69uhhqun9d45h2mfwwpexjmtpdshxuet59upzpml96ysd7rxzjra8fpe8ldz6cjru4tf5d48j9yatq60g7q0u2xvpqvzqqqqqqyvwr3c0

BITKARROT 8mo ago

Haa….urmmmm maybe that was partly my fault 😂

CASCDR 8mo ago

Are you sure it's them and not one of their users hooking up to an MCP toolbox?

aljaz 8mo ago

Sure is a strong word when going purely on hse agent but it is ClaudeBot which is their craqling agent UA (https://support.anthropic.com/en/articles/8896518-does-anthropic-crawl-data-from-the-web-and-how-can-site-owners-block-the-crawler)

CASCDR 8mo ago

Interesting!