Subnostr

Yeah, but we don't actually need that, as most relays are now aggregators.

haha, yeah... this is the flaw with the bigger is better strategy when you can get a lot of mileage out of horizontal replication, especially when the database has GC to contain disk usage and the concomittant search cost of a larger database

Replying to

Yeah, been researching and Twitter and ChatGPT actually use SSE, not websockets. They're streaming out, not in.

You only need one-way streaming, as the client knows when you are writing. There is normally no use case on Nostr for two-way streaming, in fact, since we don't really have live chats.

even WITH chats it still makes no sense to add all that extra complexity

if it was streaming audio/video, different story. but then you would use RTP instead anyway.

Replying to

at least some sane people realized, tbh

FoundationDB’s new Redwood engine, underlying architecture of S3, NFDB’s IA store as well

that's great. i think badger is cuter and more mature tho

Replying to

Okay, that gives us two sane connections to tap, one local, one external. Good.

Feeling cute. Might fix Nostr.

yeah, i think as always hybrid is generally better than dedicated. smaller, simpler parts that are built with clean simple interfaces, are much more easy to scale per use case

Replying to

🔥

Relays like strfry, khatru, realy are great until they are not.

They work great for caching locally but when you get to scale it implodes.

You want to store 1TB of books. You have to rent a single server that can store 1TB.

But what if it goes down? So you buy a few more replicas. Then you try to shard events across servers and fail.

Have fun compacting the database or upgrading it every once in a while.

NFDB fixes this. Just like SQLite is great for small scale, Postgres is better for larger scale.

badger is better because it has split key/value tables. a lot less wasted time compacting every time values are written, and easier and faster to use key table to store some of the data that has to be scanned a lot.

for whatever stupid reason, nobody else in database development has realised the benefit of the key/value table splitting, even though the tech has been around for 9 years already.

probably similar reasons why so many businesses are stuck with oracle

Replying to

I’ll have a REST API if you want. You can ask it to only send IDs or the full events.

Want to get the events by ID? Just use an ids filter on the same endpoint, no one has a use for 2 endpoints doing 1 thing.

But no SSE. Browsers put a limit on the number of parallel HTTP requests and SSE may exceed that as they stay open for a long time

Also, NFDB will support cursors on all interfaces so you can paginate without the pain. Forget created_at based pagination, which can be very unreliable.

If you want to know how to do it, calculate the lowest created_at for each relay, and use the highest one as the next one to use. Or a relay with a very old event will say “nope nothing left”

yeah, i've been thinking about how to do SSE properly. it seems to me the right way is to open one to handle all subscriptions, starting from when the client wants a subscription to be open, one is opened, and everything goes through that, and the event format (as in SSE event) includes the subscription id related to it.

this avoids the problem of the limits, which i think are typically about 8. but even 8 is plenty, it's just not necessary because you can multiplex them just by using subscription IDs just like the websocket API does. it also simplifies maintaining the subscription channel open, and it also allows arbitrary other kinds of notifications to get pushed as well, that we haven't thought of yet, aside from subscriptions to event queries based on newly arrived events.

why i think use SSE instead of a websocket? because it's less complex, basically just a HTTP body that slowly is written to. this pushes everything to the TCP layer instead of adding a secondary additional websocket pingpong layer. the client also knows if the SSE has disconnected and can start it up again, and the subscription processing on the relay side should keep a queue of events so when a subscription SSE dies it pushes the cache of events that have come in between the last one sent to the client (so it also means there needs to be a top level subscription connection identifier, IP address is probably not going to always work for multiple users behind one NAT).

also, just keep in mind that the websocket API by default creates a subscription management thing for every single socket that is opened, whereas if you do the queries primarily by http requests, this is slimmed down to a single subscription multiplexer, which will make it more memory efficient as well.

i don't think that there is enough of a clear benefit in using websockets for everything, their only real utility is for highly interactive connections, the complexity of multiplexing them at the application layer compared to one shot requests most of the time and one subscription stream for pushing data back is a huge reduction in the required data for managing a single client connection

Replying to

I agree we need algorithms!

we need full text indexes first!

algorithms kinda depend on that

it's also a good use case for LLMs to take those fulltext searches and use their semantic juju to filter and sort results by relevance, there are ways to do this with simpler stuff but feeding an LLM a set of results to rank them by relevance is basically what they were born for. i've been using LLMs more in my programming work and what they are really good at is sifting through a lot of data and picking out relevant stuff. they are not so smart at writing code because of the causality relations between processes and ontology of the data. this is why there is these "agent" things, which basically use similar principles as language compiler state machines to define a procedure, and the LLM creates plans, and then evolves them as it acquires more input from the code.

anyway, yeah. we need full text indexes first. DVMs should not exist - the relay should have this facility built into it, and then a worker that takes results and sends them to an LLM to filter and sort them.

Replying to

yes, that full ID index, it also contains a truncated hash of the pubkeys and the kind number and timestamp, so you can just pull all of the relevant keys for the result serials found, filter out pubkeys, kinds, slice it by range (if the index search didn't already do this) and then sort them in ascending or descending order of timestamp, and then just return the event ids in this order.

it's a much faster request to process which means once the client has this list, it can just pull the IDs with a single query for its initial display, and add some extra to add some room to pull the rest as the display requires them, lazy loading style.

this is the key reason why i make this index and i designed it so that it is as svelte and sleek as possible for both bandwidth and rendering efficiency

none of this is possible if you don't use a badger key/value store also. and this is also why they built badger, its main purpose for existing was to serve as the storage engine for a graph database, which is very read-heavy, you know database stuff, so you know that joining tables is the biggest time cost in a graph database, which is basically a database that does massive table joins to pull results.

Replying to

ah, just to explain how you do things with badger, because it differs from most other key/value stores due to the separation of key and value tables...

because writing values doesn't force any writes on the keys, keys stay in order a lot more, generally, once compacted, forever compacted (compaction is playing the log out to push it into an easily iterated, pre-sorted array)

as a result, the best strategy with badger for storing any kind of information that won't change, and needs to be scanned a lot, you put, very often, values in the keys for such immutable stuff, such as tombstones

it's also used for searching, as you would expect, but this is the reason why when you use badger (properly) to write a database, it's so much faster. it doesn't have to skip past the values when its' scanning, and you don't have to re-compact the keys when you change values (and yes, it of course has versioning of keys, i don't use this feature but in theory there is often some number of past versions of a value that can be accessed with a special accessor for this, but more generally it makes the store more resilient, as you would expect)

so, yeah, current arrangement for tombstones in realy is the first (left, most significant) part of the event ID hash is the key. finding it is thus simple and fast, just trim off the last half and prefix with the tombstone key prefix and even you can just use the "get" function on the transaction instead of making a whole iterator, so it's very neat, and very fast.

i also exploit these properties of badger key tables with the "return only the ID" functions by creating an index that contains the whole ID after the event's serial number, which means the event itself doesn't have to be decoded for this case, which is a huge performance optimization as well.

yes, that full ID index, it also contains a truncated hash of the pubkeys and the kind number and timestamp, so you can just pull all of the relevant keys for the result serials found, filter out pubkeys, kinds, slice it by range (if the index search didn't already do this) and then sort them in ascending or descending order of timestamp, and then just return the event ids in this order.

it's a much faster request to process which means once the client has this list, it can just pull the IDs with a single query for its initial display, and add some extra to add some room to pull the rest as the display requires them, lazy loading style.

this is the key reason why i make this index and i designed it so that it is as svelte and sleek as possible for both bandwidth and rendering efficiency

Replying to

idk how it works with other databases but with badger you can use these "batch" streaming functions that automatically run with as many threads as you specify. a mark and sweep style GC pass on 18gb takes about 8 seconds on my machine, probably faster on current gen NVMe and ddr5 memory

the GC can also do multiple types of collection as well, all at the same time, so you could set it to prune stuff that you keep access counters and first-seen timestamps as well as snuffing old tombstones

ah, just to explain how you do things with badger, because it differs from most other key/value stores due to the separation of key and value tables...

because writing values doesn't force any writes on the keys, keys stay in order a lot more, generally, once compacted, forever compacted (compaction is playing the log out to push it into an easily iterated, pre-sorted array)

as a result, the best strategy with badger for storing any kind of information that won't change, and needs to be scanned a lot, you put, very often, values in the keys for such immutable stuff, such as tombstones

it's also used for searching, as you would expect, but this is the reason why when you use badger (properly) to write a database, it's so much faster. it doesn't have to skip past the values when its' scanning, and you don't have to re-compact the keys when you change values (and yes, it of course has versioning of keys, i don't use this feature but in theory there is often some number of past versions of a value that can be accessed with a special accessor for this, but more generally it makes the store more resilient, as you would expect)

so, yeah, current arrangement for tombstones in realy is the first (left, most significant) part of the event ID hash is the key. finding it is thus simple and fast, just trim off the last half and prefix with the tombstone key prefix and even you can just use the "get" function on the transaction instead of making a whole iterator, so it's very neat, and very fast.

i also exploit these properties of badger key tables with the "return only the ID" functions by creating an index that contains the whole ID after the event's serial number, which means the event itself doesn't have to be decoded for this case, which is a huge performance optimization as well.

Replying to

Should probably go by list-size, rather than date. Has more to do with how hard it is to filter against the list.

idk how it works with other databases but with badger you can use these "batch" streaming functions that automatically run with as many threads as you specify. a mark and sweep style GC pass on 18gb takes about 8 seconds on my machine, probably faster on current gen NVMe and ddr5 memory

the GC can also do multiple types of collection as well, all at the same time, so you could set it to prune stuff that you keep access counters and first-seen timestamps as well as snuffing old tombstones

Replying to

Could probably go for at least a year, without clearing, on a local instance. Wouldn't get enough traffic to have a gigantic list.

yeah, storage is cheap... a year would be definitely sufficient to ensure it isn't stored again

Replying to

Wondrej

Facts, before I planted it though, I had the whole bed drilled and powdered with Mycohriz.

I think I made a mistake in spreading the seedlings after germination and disturbed them.

mistakes were made. next round will be better.

baby carrots are so yummy. just scrub em hard with a brush or stainless steel scrubber instead of peeling them anyhow.

Replying to

Wondrej

Good luck m8 ✊🏻

First times are always funny 😆

I sowed them wrong when the seeds started germinating, hence such mutants 😆 This is probably the hardest time of my first experience. Carrots are one of the most susceptible to transplanting, so dividing is the most important process.

Btw it's a pitty that I didn't find out about the "purple haze" cultivar until after I planted normal carrots 😆

I definitely have to try it next year

mmm yeah, purple would be looks like beetroot, fine texture and crunch like carrot

Replying to

Wondrej

Gm Nostr, my first baby carrots are kinda mutants, but they taste good 😆🥕

Needed to get rid of them form growing bed so they don't get in the way of the big ones carrots 😿

yeah, it also helps to make sure that there isn't any pebbles or big rocks in a carrot bed

they are harder to process for cooking when they are all deformed like this

Replying to

Also, tombstones. Why does everyone store the deletion requests? Just don't save those eventIDs again.

yeah, realy has tombstones... and yeah it really should not store them, but only push them out to relays that are subscribed to them (which would be driven by the req of the replicas)

tombstones do eventually need to be cleaned up though. the tombstones in realy have a timestamp on them, i had it in mind eventually to make a GC to clear them out when they get too numerous and prune out the oldest ones that are unlikely to appear again (let's say, after 3 months or something)

Replying to

The event id list is so mind-numbingly obvious. LOL

Send me the list, let me find what I don't already have and then send me those in full. Like, duh.

Shit so retarded, fr.

yeah, there is also a major missing thing - negations of all the filter fields, it would be so simple to add it. you could then exclude events by id, pubkey, or whatever. all credible database query systems have negations

"send me all that match this, except those that match that"

and yeah, having the option to just get the event IDs instead of the whole shebang.

the real reason why all this stuff doesn't already exist is because nostr "envelopes" are such a shitty, hard to extend API, that pretends to not be an API.

Replying to

The event id list is so mind-numbingly obvious. LOL

Send me the list, let me find what I don't already have and then send me those in full. Like, duh.

Shit so retarded, fr.

yeah, the function that gives event IDs based on the database sequence number also makes syncing so easy. don't tell semisol i said thanks for giving the idea tho. he already has too much air pressure between his ears

Replying to

it's gonna be epic! i can't wait to see it working.

the rest of the nostr vibe coding clowns will be choking on our dust :)

i mean, not the rest. just the vibe coding clowns who aren't really doing anything but get all this money, and we are doing it mostly on our own dime, part time. haha. it's gonna be funny watching them try to pretend it doesn't exist.

Replying to

Hey, want a little performance data in the report? Is maybe interesting, yes?

place your test report orders, now

haha, yeah, a benchmark would be pretty cool, especially if it can be pointed at any standard nip-01 relay for comparison

Replying to

also, i want to vomit in the general direction of anyone who thinks that popular content should be free.

no content should be free. but at the same time, you can't control who copies it.

so, yeah, if you want to copy what other people post, fine. but if you think that you can run relays at zero cost. what the fuck is wrong with you?

just fucking go back to free facebook you fucking retard

mossad and cia will love you

Replying to

quality is subjective, and can't be discovered until after it's seen

unless you put some limit on what can be STORED on a relay how do you evaluate what anyone wants to see, and then ... you know.. pay someone to store it?

i say, no. you have to charge to store it. first. other relays can decide to copy that data, it can be freely accessed from teh first relay, so, whatever. but i am vehemently against popularity being any kind of means of deciding the cost of storage.

why? because storage has a fucking cost.

you make popularity a premium that means no price to post?

then you, normal, unpopular person, are shit outta luck, i mean what the fuck. come on, this is not ok

also, i want to vomit in the general direction of anyone who thinks that popular content should be free.

no content should be free. but at the same time, you can't control who copies it.

so, yeah, if you want to copy what other people post, fine. but if you think that you can run relays at zero cost. what the fuck is wrong with you?

just fucking go back to free facebook you fucking retard

Replying to

yeah, that's the kind of paid I'm talking about. if the posts are high quality, they will be hosted for free, but people will pay to auth and read from the relays

quality is subjective, and can't be discovered until after it's seen

unless you put some limit on what can be STORED on a relay how do you evaluate what anyone wants to see, and then ... you know.. pay someone to store it?

i say, no. you have to charge to store it. first. other relays can decide to copy that data, it can be freely accessed from teh first relay, so, whatever. but i am vehemently against popularity being any kind of means of deciding the cost of storage.

why? because storage has a fucking cost.

you make popularity a premium that means no price to post?

then you, normal, unpopular person, are shit outta luck, i mean what the fuck. come on, this is not ok

Replying to

paid posts sounds a lot like advertising to me

paid to show?

i don't mean that. i mean pay to store it on a proverbial relay

Replying to

I think auth will catch on eventually. the problem is too many people are stuck in the mindset of paying to post, when it should be pay to read

all men want is this and a racist gf with no tattoos https://blossom.primal.net/b4513314735491fa18885fb3e65ae271d05cbccf432edbde0cbfe22212b6bb9b.mp4

i think it should be both.

what the actual is zaps about anyway? i mean, you know. that's money, right? amirite?

Replying to

₿en Wehrman

the fact is that men are sick to death of whoring bitches, who don't want to agree with a man.

Replying to

i loved that whole cybergoth thing... the goggles, the dread falls, the vinyl and spikes and i forget what they call that kind of stockings, the 11" boots and all that shit. i still wear a lot of those kinds of things, the nice black combat pants, the tall boots, i have some nice tactical gloves and even still have some goggles... if i had a place to go stomp to this music i'd probably dye my hair black and dread it up and probably even shave my eyebrows because that look is freakish and weird

oh yeah, fishnets

Replying to

Niel Liesmons

Parsing the text etc might be dumbe and BlueSky would have an easier job with their set up probably.

But this is exactly what is done in Nostr Wikis already. So what the case against having links like that in general?

you are talking about, essentially, a URL right?

so, yeah, noh. wikilink is not better than a good old W3C standard spec URL

wikkilink assumes the consistency of a distributed store of data. we don't have that guarantee yet. nostr or similar protocols could create such a guarantee ... if there was a replication strategy baked into it.

making a standard nostr URL would be what you are thinking of. something that binds to a static, and permanent event ID, that is retained in order to create a history of edits. that's a lot of assumptions and a lot of protocols that don't exist yet.

Replying to

yeah, this hill is serious, it's like my smokes are at the top floor of a 10 storey building and i'm living in the basement

i gotta go easy tomorrow though, i'm on the edge of overtraining

actually, i discovered i had some non-functional remotes in the git config. removed them and the push actually works now as expected

there is some other issues though. the code check and dependencies updates functions seem to go spaz but it's not really causing a problem because junie is busy making problems for it that it seems to get stuck trying to figure out. haha. so, it's working fine, just some boring old regular intellij minor glitches

Replying to

Free Man's Perspective

Let's play a little game:

When the dollar collapses, the first thing I’ll do is ______.

#Nostr #Bitcoin

drink my oldest homemade rakija and eat a big pile of bbq goat

it is using claude 3.7 by default btw. no idea if the other option, 4, would be better. i might try it. maybe it writes more correct code more quickly

Replying to

Daniel Wigton

Ok, I may have also studied, chemistry, physics, and quantum mechanics as part of my Engineering Science degree as well. But it's been a while so claiming expertise based on it is a stretch.

credentials are for losers. i know enough to be dangerous, and am always learning more to become more dangerous.

Replying to

Jose Sammut

Where's text from?

https://medium.com/baseds/logical-time-and-lamport-clocks-part-2-272c097dcdda

Replying to

Daniel Wigton

Quantum entanglement doesn't actually allow FTL comms. If you and I share an entangled qbit between earth and Europa all that means is that when I measure the value of my qbit it will be in instant agreement with your qbit, but the value is random. We have no ability to decide what value we see. We only know that we are seeing the same thing.

You can use this to send a message that cannot be intercepted, but you still have to send the actual data by conventional means. The qbits just make a fancy one-time-pad

if you can do any kind of computing with these things then you can modulate a signal. you couldn't use them to do schnorr's algorithm unless there was ways to take that randomness and use it to create order.

the reaction of these things to inputs has a timing feature to it that would likely be able to encode digital bits, albeit maybe quite slowly, nevertheless, detecting the change of state should be possible within at most a few thousand points of its change pattern.

bitcoin's difficulty adjustment is a process of adding a signal to a poisson point process that uses only 2000 something samples that hit the threshold (ie, block solutions) that results in a steady token emission rate.

i'm quite sure that a similar thing can be done to identify a signal sent over such a random process that would be at least 56k modem speed. hell, if it can even do 300baud that's still enough to have IRC chat across it, given a modified, minimised protocol.

Replying to

i've had a bit of a rough day with people failing to understand the logic of my server's API, still confusion about it 3 weeks after i raised the point that we probably need another table and endpoint to handle some specific state and only just realised today that this had not been understood, and so much difficult communication in the weeks since that time and still no clarity is dawning.

so i'm feeling pretty ugh right now

and i go and look at this code and it's like, to remove all that auth stuff is a massive job, but i have to do it to fix probably what will be most of the problems with this thing and there will be no auth.

i'm really struggling to find a motivation to actually do this.

but then i'm also quite bored. ok, gonna try anyway. just remove all the auth related stuff, and all the admin stuff that requires auth.

anyhow, if you go to https://realy.lol you will find there is now a branch `minimal` which has all the auth removed. it works, seemingly ok, with nostrudel but jumble doesn't seem to recognise that events are saved for whatever reason.

anyway, point being there is now an ultra bare minimum realy that should not be let outside into the wild internet where it will quickly be laden with gay porn and yodabotspam.

and maybe it needs some fixing with how it's sending back OK messages or something.

it also has the HTTP API there but all the admin stuff has been removed because there is no auth anywhere now.

didn't really take me that long to fix. just remove things, then compiler complains with lists of all the things that are broken, i just go through and remove them and recompile until nothing complains and it runs.

that's probably about it for me building nostr relays tho, unless i can get paid for it. my day job is too much of a pain at the moment and i really need to keep this job and also need to recover my dignity and sense of self worth at this point.

test

i just realised that i haven't installed any of my private/direct messaging things as advertised in my profile kind 0 event.

i'm having a big problem trying to justify wasting my time to enable people directly messaging me. because nobody ever did before and nobody seems to want to work with me. i'll just focus on my work. kthxbye

Replying to

Hi nostr:npub1fjqqy4a93z5zsjwsfxqhc2764kvykfdyttvldkkkdera8dr78vhsmmleku.

Kindly go fuck yourself.

nostr:note14qklh45n8gpjmr55hunr7l9fd35n0h0qhyp3usav7tycknddxwdqw3wheg

https://following.space/d/stvg18jb931n

this was beyond the pale.

there has been a watershed. before getting called an internet scammer by semisol, and after.

after, there shall be no more following anyone who wastes their time reading your bullshit. and i certainly won't be renewing my subscription to your gay relay.

Replying to

7fqx

I don't understand what is going on lol but they are both top tier curmudgeons with top tier posts.

half of these people are assholes tho. realy.

Replying to

Séimí Mac Síomón

I am a curmudgeon evidently, and if you don't like that, well then, you may stuff it up your arse.

curmudgeon is an irish word i think?

my mother's family is scottish by line of the fathers but i know we gots a bit of welsh and irish in there too. i certainly have the blarney.

Replying to

Lucas M

In this particular case, yes.

i'm done with him and everyone who gives him the time of day at this point

Replying to

Noshole

Doesn’t Semisol hate everyone? Maybe it’s just me 😂

semisol accused me of ... what is it called? extortion?

Replying to

Bfgreen

Ah.. selfies + boobs = followers. Even on nostr. Good to see social media norms being true here. I'll keep following the nostriches that share meaningful content.

you really want nostr to become a boob channel?

Replying to

Comte de Sats Germain

That's part of why I take the "gnostic" cosmologies seriously. Its pretty obvious that its parasitism all the way up and all the way down, in a never ending fractal that overlays the base fractal pattern of consciousness and matter. I don't go as far as some of them do when they say we're worshipping the wrong god - but Satan would do everything possible to be mistaken for God because he tries to copy God. Whether that's inherent in reality or just a product of sin is the point I'm unsure about. There's tons of biblical support for the notion that this isn't the real reality, that its just as "fallen" as we are, and if its true that consciousness comes before matter, then the fallen state of man would equal the fallen state of matter. Is it physically possible to construct a reality without parsitism and predation? Would I be able to conceive of it, given that I exist within this paradigm? How much of parasitism is a product of time? Could there be a restoration, a reversal of the degeneration of ages, while still having the experience of selfhood?

Mmm... Welcome to the inside of my crazy mind...

every living thing has to eat but parasitism... i mean, parasites live on you and inside you, and they don't benefit you.

lichen is algae and bacteria helping each other out.

humans herding sheep, goats and cattle is not parasitism, the animals pretty much live their natural life under protection and instead of tearing them apart slowly we kill them with precision, and honor every part of them, feeding our plants, wearing their skins, ornamenting our walls with their horns.

most of the stuff that lives inside us is not parasitic, it is a mutual benefit between us and them.

it kinda contradicts the whole nihilistic heat death of the universe model of reality that living things generally are helping each other. the parasites are outliers. probably a great deal of relationships between life forms are actually about crowding out the space where parasites might be.

not only are they cheap and fake they are depressing.

Replying to

i will make this in my rewrite of realy too... it will be a relay side thing, a separate "sync" endpoint that will have a thread maintaining the IDs of recently stored events in a cache and per-subscriber queue state management that will always send events, and the receiver will ack them instead of the current scheme where it's fire and forget