Nostr Web Client

I know it's kind of bullshit, but both CharGPT and Claude independently estimated the sum total of all DNS records is on the order of 1 or 2 TB. This is $20 of storage space today. I'm an edge case, but I can download this to my house in less than two hours. How many top level domains are changing addresses every minute?

What I don't like about DHT's is that you have to ask for everything. This means you need to be a node, or have a friendly node, in order to find things. It also creates a trail of everything that you're looking for, which makes it really easy to notice when people are looking for things that are controversial. Pushing data doesn't have this problem: yes, you need to find a source to pull from, but you can benefit from bulk optimizations. Then you can query things in zero milliseconds, and there only record is between you and your local copy.

That's real decentralization

Nuh 8mo ago

If you believed this actually works you would have been caching the entire DNS in your host.txt file, the original way people did DNS until they realized they need a structured network... but you know it doesnt work so you don't.

Replying to

ynniv

I know it's kind of bullshit, but both CharGPT and Claude independently estimated the sum total of all DNS records is on the order of 1 or 2 TB. This is $20 of storage space today. I'm an edge case, but I can download this to my house in less than two hours. How many top level domains are changing addresses every minute?

What I don't like about DHT's is that you have to ask for everything. This means you need to be a node, or have a friendly node, in order to find things. It also creates a trail of everything that you're looking for, which makes it really easy to notice when people are looking for things that are controversial. Pushing data doesn't have this problem: yes, you need to find a source to pull from, but you can benefit from bulk optimizations. Then you can query things in zero milliseconds, and there only record is between you and your local copy.

That's real decentralization

Nuh 8mo ago

This a misunderstanding about DHTs.

1. You don't need to run a node, you can ask for data in a client mode, just go to Pkarr repo and run the examples, start the process make the query close the process, in and out less than the time to open an average Web page.

2. No one said don't use relays to cache data from many people's queries, no one said don't gossip and cache long term on top... what we are saying is; we need a source of truth that scales and we can fallback to when data is not available in your private cache that you got from your friend on a USB stick.

Replying to

ynniv

I know it's kind of bullshit, but both CharGPT and Claude independently estimated the sum total of all DNS records is on the order of 1 or 2 TB. This is $20 of storage space today. I'm an edge case, but I can download this to my house in less than two hours. How many top level domains are changing addresses every minute?

What I don't like about DHT's is that you have to ask for everything. This means you need to be a node, or have a friendly node, in order to find things. It also creates a trail of everything that you're looking for, which makes it really easy to notice when people are looking for things that are controversial. Pushing data doesn't have this problem: yes, you need to find a source to pull from, but you can benefit from bulk optimizations. Then you can query things in zero milliseconds, and there only record is between you and your local copy.

That's real decentralization

Nuh 8mo ago

First; DNS currently is not used by most people, let alone bots.

Secondly It doesnt matter what are you willing to download, the question is how on earth am I supposed to know that you have the data I am looking for?

Thirdl, if your solution is " always download and horde everything that anyone publishes" then I personally and many more will start spamming the hell out of you until most nodes give up and churn, just to troll or to use you as a free storage system.

And that is before getting to the freshness of data.

My point is, claiming that you can have a full replication of open data with redundancy is just false, you either have to sacrifice redundancy (centralisation) or sacrifice full replication or sacrifice openess (make it paid like bitcoin).

And as soon as full replication fails, you are back to; who has the data i am looking for? I have a URL, where is the server, the only way to do this reliably is structured networks, not gossip.

Replying to

ynniv

I guess we'll see whether Nostr survives or not

Nuh 8mo ago

I mean, Nostr is very small, and very centralised (most people read and write from and to small set of servers) and STILL full replication is not the case, so unresolvable links are common.

And this is the best case (the social media), where lazy gossip is natural, try doing this for cold queries like curl which is what counts as the Web.

What do you see in the future that makes this better not worse?

Replying to

ynniv

Or more directly, Lightning

Nuh 8mo ago

Lightning is literally a network of mutually trusting nodes, directly or indirectly through a payment channel.

It is not a gossip network trying to store and retrieve ever growing set of data reliably and performantly.

I want to remind you that I suggested RBSR paper to Hoytech and helped with Negentropy, but it doesn't work for what you want to use it for. It works for passively replicating data you are interested in, it doesn't magically make all the data available to everyone, and if the data is jot available to everyone, then you again have to deal with the question of ; how to find who has the subset i am looking for in less than 500ms

The answers we know of are;

1. A structured network (DNS/DHT)

2. Small set of servers (like ten popular relays)

If you are happy with (2), fine, but then you have to explain what incentives do they have to serve the entire web (costs a lot).

But what you can't do is claim that you can either have full replication of an ever growing data across thousands or millions of nodes, OR claim that you can have partial replication across 1000s of unstructured nodes and have fast queries.

At least not with extraordinary proof. So maybe start building and let's poke at it and see what happens. Or run a simulation or something.

Replying to

ynniv

Other than BitTorrent?

Nuh 8mo ago

Bittorrent isn't at all what you are describing, when. Peers are gossiping they are not sharing an ever growing dataset, it is a static file, and if it was a changing data set they would necessarily be discarding portions of it because they don't have infinite storage. In fact most seeders Immediately delete the static data as soon as they are done with it.

You can try to slow down the data growth as much as you want, but the best you can do is make it as expensive as a bitcoin transaction and make it take half an hour to hashcash an update, but ignoring the awful UX of that, the data still grows, forever.

How many nodes do you expect dedicating 100 gigabytes for that? Definitely not millions or thousands... definitely not as many as bitcoin nodes since it is not as profitable or necessary to store the full ever growing set.

And then these few nodes have to serve the entire Internet, and they become easier to control or attack because they are few.

You will never be able to have the full dataset, you will have to discard data, and the moment you start doing that, you will realise that people trying to read need to find out who has the parts they need, and they can't because there is no structure. So they will have to ask everyone, and that is exactly what a DHT is meant to make scalable; how to ask log20(n) nodes instead of n nodes.

Anyways, just build it, and see if you can survive spam without degenerating into a handful of nodes like an abandoned Bittorrent infohash.

Replying to

ynniv

I understand that you're deep into DHT's, but I'm not sure that your arguments stand on their own. If the goal is to reduce updates, either system will use some form of proof of work, or rate limiting, which is a proxy for the work of maintaining a node on the network. HashCash is a fine proof of work, it just didn't work for email because it's too hard to tack on to an existing system.

What I think undermines your arguments is the lack of perspective on designing a solution to fit the problem. A DHT is a poor solution for storing my grocery list. A file is a poor solution for storing current stock prices. If the goal is to replace DNS, that just isn't very much data compared to the 40 TB of space I just slapped in my PC.

If the goal is to locate things that are constantly changing locations, then fine use a DHT. But most things that can even be addressed don't change addresses very often. My residential "dynamic IP" is stable for many years at a time. Using a DHT either increases the latency of the vast majority of lookups, or results in the same caching that DNS is known for...

I vote for simpler systems that benefit from continuous advances over complex systems that try to squeeze a little more out of what we had yesterday

Nuh 8mo ago

Please tag me when you build the system that you think is simpler than a DHT, so i can poke at it. For now what I have is DHTs that work in theory and in practice. I claim that gossip persistent replication will always degenerate to centralisation, and have long list of historical case studies, and nothing would make more excited than trying to break another attempt, and failing.

Replying to

ynniv

I doubt anyone here will argue with experimentation. As far as costs of new records goes, this seems like a great application for hash cash: if the difficulty is too high for your device, there could be a simple market for signing new records. What I'm suggesting is a system less like a "DHT", and more like Bittotrent. Now BT uses a DHT for finding files, but once you've found "the file", the rest is essentially spraying bytes

Nuh 8mo ago

Bittorrent had 15 years to develop a system where the gossip part can't be spammed/bused and where sharing data is fair, and nothing came up better than good old trackers.

My point is, the only system we know of where gossip works at scale for persistent data is bitcoin, and that is because you have to pay fees to gatekeepers who themselves have to pay in a verifiable scarce resource.

And it seems you are reinventing that by thinking in the direction of hashcash, where a proof of work entitles you to store your data.

The problem is, hash cash was already invented to solve the centralisation of email by countering spam, and it failed miserably, and will fail again every time, I bet.

Replying to

OrangeSurf

I've put some numbers together to help you determine whether the horse has already bolted regarding the use of bitcoin for non financial transactions.

49% the UTXO set is sub 1000 sat outputs

43% of transactions in the past 2 weeks had an OP_RETURN or were inscriptions.

Nuh 8mo ago

And the blocks are not even full, and everyone is using a custodial LN wallets.

nostr:nevent1qqspgf775tpyr733puye88duzwzgl0ss6ac7rwd6qket0ueh2023vvgzyq7aa6jjwqy7yhyg6dpp9a7f4gmrgna7a0ftdxsyaemlls7qaaehzqcyqqqqqqgkaq7j9

Replying to

τέχνη

I agree about small worlds though. We need ACLs like private accounts. Most people don’t want or need to be fully public to the world.

ACLs typically need everyone to have their own server though. Otherwise you have massive whitelists you’re managing and updating in weird ways

Nuh 8mo ago

Massive lists are cheap to store and cheap to search through. And the somehow is a question of ownership, we know how to make collaborative lists managed by owners/moderators.

Replying to

τέχνη

> and that is the network that has the best chance of going mainstream.

With their leadership? They never had a chance.

Nuh 8mo ago

I don't think there is any evidence that any leadership could have done better, certainly I wouldn't, and no one here would have, not sure who do you have in mind who could have done a better job.

Replying to

ynniv

The choice isn't between devices having the whole dataset or not: even if you had the whole thing, you'll never have the most recent updates. The choice is between "large caches" synchronizing the current state over a DHT protocol, or by negentropy / reliable UDP. Given storage is ~$10/TB and falling, it makes more sense for each node to store everything than it does to be fancy about where things are located

Nuh 8mo ago

Got it, this is usual gossip vs DHT question.

I think you are underestimating how vulnerable to spam this gossip network will be, and how expensive it will be.

There is an inverse relationship between cost of nodes and their decentralisability, and I claim that in a system with no fees for creating new inputs, the only stable configuration is a consortium of small mutually trusting servers, basically like Email.

But of course if that is not convincing, we can always try things in practice.

Replying to

ynniv

DHT is a "live" system. This has certain benefits, like efficiently propagating changes. It also has certain limitations, like active connectivity. Files also have benefits, like compression. They also have limitations, like inefficient propagation. The question shouldn't be which is better, but which is better for a specific use. For instance, files aren't a good way to store real-time status data. On the other hand, data stored in a DHT will never be available when you're offline or off-planet. I think the decision comes down to how often you expect data to change

Nuh 8mo ago

I agreed with most of this until the end, the question is how often are you going to make a query to something that you haven't cached before. I call this the cold lookup.

And the answer to that is the vast majority of the time for two reasons;

1. There are more URLs on the Web and more endpoints than you can cache, and if you want to replace ICANN DNS, and not just contact list in a social app, then you should expect that most URLs are seen for the first time or it was evicted from your limited cache.

2. You want to support stateless clients, so caching at relays is much more reliable than expecting every client to start with it's own equivalent of host.txt of the entire web

Again, we already know what works for the general purpose, and it is not local host.txt.

But I am not saying you shouldn't use local caches if you can, you should, and Pkarr encourages that, but it is just one layer of caching, and without the falling back to the DHT, the UX really doesn't work, you can't take a step backwards from the status quo, especially when you are already asking users to do the hard thing of managing their own keys, you can't add to that URLs that have a 10% chance of working and "depends".

Replying to

ynniv

Yes, it's inefficient from a logic and correctness standpoint. But if you put a billion records into one file, I could download it in a couple minutes, write it to a MicroSD card, and snail mail it to a friend taped inside a letter. It's not a lot of data anymore.

Whereas billions of records in a DHT starts to get fragile

Nuh 8mo ago

Is replacing DNS (structured network) with manual gossip a step forward or backward?

Mainline DHT has millions of nodes, so for any given moment in time there is a capacity for billions of small packets.

When you introduce a caching layer (relays) on top, you get the structured (yet flatter) hierarchy of DNS, where the DHT is the alternative to root servers.

You combine that with the natural semantics of DNS, especially TTL, and these relays now can know when to check the DHT again.

Then you only need the DHT to contain your data only often enough for relays to pick it up.

It is not about efficency or correctness, it is about the UX and DX and reliability of the system... remeber that DNS started with manual gossip of host.txt contents... this didn't scale, why try it again?

Replying to

Super Testnet

making it cost more is also a good outcome

Nuh 8mo ago

Worst case scenario that just makes centralised mining pools even more profitable vs miners that are only getting transactions from the mempool.

Best case scenario, another gossip network emerges, with simple enough code base that miners don't mind running in parallel to Bitcoin Core to hear about non standard transactions. Even if they don't trust it they can firewall it one way or another.

Nuh 8mo ago

If people keep pretending the mempool is a gatekeeper, people will start submitting transactions to mining pools directly like popular Nostr relays... is that how you keep bitcoin mining permissionless and decentralised?

Replying to

Super Testnet

I am not endorsing this idea but I am bringing attention to it because it sounds like a fun way to fight Citrea and Stamps:

Modify bitcoin core to enforce a mempool policy that rejects transactions that put money in addresses without including a proof that each address can be spent from. The proof doesn't go in the blockchain, it's just relayed with the tx to prove its outputs are spendable.

This would require wallets to create the proof when generating an address and share the proof with the sender. It would also require multisig wallets to use some sort of coordination mechanism to produce the proof. But it requires no consensus changes.

User flow: Alice wants to send money to Bob. Alice asks Bob for an address. Bob's wallet not only generates the address, but also signs the message "Proof of Spendable" with his private key and shares the signature with Alice, in the same qr code as his address. Alice's wallet scans the qr code and constructions a tx paying Bob's address, and when it sends the transaction to the mempool, it shares Alice's signature along with it.

Nodes only propagate the transaction through the network after verifying the signature so that they know Alice can actually spend the money in that address, meaning it's not just a data carrier. Stamps and Citrea, RIP.

Nuh 8mo ago

This won't stop Citrea but would motivate them to use the OP_RETURN since they have to contact miners directly anyways.

Not a bad outcome.

Nuh 8mo ago

Bluesky metrics are steadily declining, and that is the network that has the best chance of going mainstream.

It is time to stop caring about global feeds and just make our small-worlds web better and more sovereign and be happy with that.

Nuh 8mo ago

You can't fight physics, all centralised social media will have to deal with this, and I don't think you can make a non-centralised social media, unless you redefine social media to; chatt apps that feel like twitter but have the same reach as a telegram group.

Nuh 8mo ago

OP_RISCV

Replying to

Gregor

I finally had the chance to look this over. I admit that I only skimmed over it, since my skepticism has no connection to the suggested corporate or distributed issue. Furthermore, my only personal attempt at public collaborative development has subjectively turned out entirely unfeasible and ineffective. In regard to the supposed dichotic alternative of corporate development, the current definition of a corporation, closely tied to the nation state model, looks entirely inadequate for the reality of global real time data networks. I might lack enough education to evaluate and articulate this more clearly, I feel entitled to at least describing conclusions from year long personal experiences.

In conclusion, I do think that both ideas and implementations can and should speak for themselves, and current Nostr developments give me a definite sense of both flexible and durable properties, and a ground up construction. Pubky lacks that impression, though it did have comparably less time to grow.

As said in another comment today, I personally just want decent basic functionality. I constantly see things do too much, and create far too much noise, including that elaborate treatise about a pretty tangential, and anachronistic issue.

I would really like to see new collaborative structures that properly address attention economics in context of the internet, while avoiding the crutch of violence monopoly enforcement.

Nuh 8mo ago

What are you working on/want to build?

Replying to

fiatjaf

So Bluesky has just banned some guy exclusively on Turkey only because the Turkish government asked.

(from https://x.com/aliskorkut/status/1912191854939943362, this is the profile: https://bsky.app/profile/carekavga.bsky.social)

Nuh 8mo ago

This can and will (if worth it at all) happen to any Pubky indexers or Nostr relays that capture majority of eye balls. And when they comply, because they must, it will be effectively the same as shadow banning.

Play search engine games, win censorship prize.

If you don't want to deal with censorship, don't get big, and don't work on problems that requires big servers to solve them, or build a blockchain and make each post cost $20.

There sincerely no third options.

nostr:nevent1qqsqqq8a48s0uy2hatxlz8hp4tlnt6ytxcyapttrtzldasjxnjyknhgpr9mhxue69uhhq7tjv9kkjepwve5kzar2v9nzucm0d5pzqwlsccluhy6xxsr6l9a9uhhxf75g85g8a709tprjcn4e42h053vaqvzqqqqqqygjacpr

Replying to

Garbage nsec

It's a no-win situation. Every time they do come out and state how things are with all the nuance, nobody pays attention. Then they post something for normies that glosses over the nuance and everyone's says they're being dishonest.

Nuh 9mo ago

You literally posted him lying by ommission, he is pretending that people Mirroring the centralised directory means anything different from just a TLD. That is a LIE. If you sit down and analyse it you should be able to see why this key rotation and DID is just a circus without a Blockchain like Bitcoin.

But the real issue here is, no matter what I say, it won't matter, because everyone decided to believe this bullshit no matter what and pretend that the nice people because they are nice achieved the impossible.

Ok fine.

Nuh 9mo ago

I have to admit, every time I see a screenshot or link from Mastodon I get envious of the quality of people and conversations there.

This can very well be a selection bias, but I might cave in and create a Mastodon account.

I just wish Mastodon was using Pkarr, so I neither have to setup a server nor worry about who is the first server I sign up to.

Replying to

Garbage nsec

Matrix we looked at but Nostr events and relays are about as complicated as we want things to get, and this is also more of a Kind1 play. This https://bettermode.com/ is an example of one we were asked to pitch against the other week.

Our pitch is if you're going to be brave, pass over the other guy and take things in house, then you're going to want the most dead simple thing possible. Nostr is pretty dead simple.

Nuh 9mo ago

The reason to stick with Nostr is getting grants from Nostr enthusiasts... but if you need that more than flexibility to address your market, then you are doomed anyways in my opinion.