Replying to Avatar rabble

What do people think of this? nostr:note13k0wh965nntau3jdx8d3ls96uus2zk778z4w6dyg9r8ptt94e6hsj94y3r It seems that Mastodon is designed to share user posts, including private DMs, with anyone who asks for them. Someone with an AI background pulled in a bunch of posts and analyzed them, including labeling the content.

These folks, Maven, followed the ActivityPub spec and the terms of service. They downloaded publicly accessible data using Mastodon servers and services as designed. They then analyzed that data and ran an algorithm to add labels, similar to how every fediverse server does. The difference here is that Maven used machine learning to add some labels, whereas others add labels such as timestamps when the local server downloads the data without using newer machine learning tech.

Bing, DuckDuckGo, and Google also do this; they crawl the fediverse, use AI and machine learning to label content, and display it in different contexts.

The tags that Maven adds are pretty innocent. They are just adding hashtag-like labels for discoverability.

Furthermore, many people are upset that Maven is leaking people's DMs. This is like living in a house where you refuse to have a front door or curtains on your windows and then getting very upset when somebody wanders in and sits down in your living room or looks in from across the street. The fediverse, by design, has no privacy. DMs are public! It says right there in Mastodon that these aren’t private. Nor are Bluesky's DMs, by the way. There is no end-to-end encryption in the fediverse yet. Evan Prodromou is actually working on this, likely adapting the MLS standard, which is great but doesn’t exist yet.

So my question is this: Why does the fediverse rely on unwritten and undocumented norms that are not mentioned in either the specs or terms of service? And why are people constantly surprised when others don't follow these hidden social conventions?

As long as Nostr's data is large enough. Anyone can use Nostr's data. At present, there is not enough commercial value for them to explore, but they will come sooner or later. Fortunately, Nostr's DM has been strengthening its privacy protection efforts.

Reply to this note

Please Login to reply.

Discussion

I'm hoping the Internet crawlers to start indexing the Wiki pages and articles/blogs, so that more people are incentivized to use them because they know it's going to be reachable over search engines.

I have discussed this issue before. Why can't Google search engines find the content of Nostr. band. If the content on Nostr can be captured through search engines, it is very beneficial for Nostr's promotion and promotion. #[4]

not searchable via goog is + addvantage

Nah, that's a bug, not a feature. I'm not putting this much effort into my writing to have it seen by 3 npubs in Paderborn.

who submits to goog crawler? or how does google spider crawls relay content?

It doesn't crawl relays, but it can crawl webpages that display notes.

But anyone can setup a relay and aggregate notes from other (readable) relays and publish them to webpages that are indexed.

this fucntion is currently done very few - even 35+ active NIP50 are NOT used most by apps only top 3 to 5 used in searches as per last findings

Only need one person to do it. Someone could just run an archive where they aggregate and store everything forever.