Subnostr

Replying to

John Dee

Y is the alignment score? What is the X axis?

someone 11mo ago

yes Y is the alignment score. X is different LLMs over time. time span is about 9 months.

someone 11mo ago

Ladies and gentlemen: The AHA Indicator.

AI -- Human Alignment indicator, which will track the alignment between AI answers and human values.

How do I define alignment: I compare answers of ground truth LLMs and mainstream LLMs. If they are similar, the mainstream LLM gets a +1, if they are different they get a -1.

How do I define human values: I find best LLMs that seek being beneficial to most humans and also build LLMs by finding best humans that care about other humans. Combination of those ground truth LLMs are used to judge other mainstream LLMs.

Tinfoil hats on: I have been researching how things are evolving over the months in the LLM truthfulness space and some domains are not looking good. I think there is tremendous effort to push free LLMs that contain lies. This may be a plan to detach humanity from core values. The price we are paying is the lies that we ingest!

Health domain: Things are definitely getting worse.

Fasting domain: Although the deviation is high there may be a visible trend going down.

Nostr domain: Things looking fine. Models are looking like learning about Nostr. Standard deviation reduced.

Faith domain: No clear trend but latest models are a lot worse.

Misinfo domain: Trend is visible and going down.

Nutrition domain: Trend is clearly there and going down.

Bitcoin domain: No clear trend in my opinion.

Alt medicine: Things looking uglier.

Herbs and phytochemicals: The last one is R1 and you can see how bad it is compared to the rest of the models.

Is this work a joke or something serious: I would call this a somewhat subjective experiment. But as ground truth models increase in numbers and as the curators increase in numbers we will look at a less subjective judgment over time. Check out my Based LLM Leaderboard on Wikifreedia to get more info.

Replying to

Vitor Pamplona

Me at 2am reading a paper on why Wombats poop in cubes...

someone 11mo ago

lower packaging costs for fertilizer industry!

Replying to

jimmysong

I realized recently that it's not the texture of liver, but the flavor I can't handle. Also, if you're in San Salvador find me to find out whether you like the liver flavor or not apart from the texture.

someone 11mo ago

combining with raw onion helps

Replying to

Gzuuus

Why? 👀

someone 11mo ago

not uploading datasets could result in more diverse LLMs based on Nostr. which i prefer at this point.

Replying to

Gzuuus

Are the nostr data sets you are using to train llms public?

someone 11mo ago

datasets are not. but notes are public..

someone 11mo ago

having bad LLMs can allow us to find truth faster. reinforcement algorithm could be: "take what a proper model says and negate what a bad LLM says". then the convergence will be faster with two wings!

Replying to

hodlbod

Open sourcing DeepSeek doesn't negate the propaganda built into the model. I don't really see why people think this is such a win. It's all propaganda, manufactured to get you to read more propaganda.

someone 11mo ago

propaganda is expected and that's the least of its problems. it has other huge lies..

price is free but the real cost is the misinformation built in it.

Replying to

David King

can nostr help here?

someone 11mo ago

in the future when there is a lot of users, the market of users interacting with LLMs can determine how to reinforce properly. best LLMs are going to be utilized most and forked off most..

Replying to

jack

i’m going to create a fund to grant a collection of nostr clients and adjacent technologies to work together on a comprehensive vision to focus on public/private communities, commerce, and ai.

much better than creating my own client.

more soon.

someone 11mo ago

i think nostr as a whole needs a reddit like experience badly!

also we may propose some projects around "well curated AI".

Replying to

₿en Wehrman

I want the most based, and the most blue-pilled matrix-dwellers to all come to nostr.

I refuse to subscribe to the notion that those still in the 🤡🌎 paradigm are doomed to be stuck there forever. I've seen far too many cases of folks waking up once exposed to the right flow of new information, and nostr delivers that flow more fluidly & naturally than any other place we've ever seen on the internet.

Bring all the brainwashed woke-folk over. I love them too, and believe a large percentage of them will find their path to enlightenment through the purple-pill 💜

someone 11mo ago

succintly and nicely put but people want echo chambers. they will go to bsky probably

Replying to

₿en Wehrman

someone 11mo ago

i realized some clients like coracle use encrypted by default. primal cant see the DM sent by coracle...

someone 11mo ago

I think Nostr clients should hide the key generation/management for a while and once the user is engaged, remind that they have to backup the keys, and explain how the keys work.. Don't overwhelm users with Nostr specific things.

Have some popular and some random relays. Everybody needs interaction in my opinion and without popular relays there is a risk of not being heard. Not being heard hits harder than centralization in my opinion.

Make it somewhat fun for the new user. "The algo" on the other networks is making the experience fun too, it is not just mind control! Help the user reach the best content on Nostr. This may be hard without LLMs but I guess DVMs are evolving.

nostr:nevent1qvzqqqqqqypzq5ztja6rghgc9tp8d8gympan8jlg90wcs7mcdmnttrtmtkcq23wlqythwumn8ghj7un9d3shjtnwdaehgu3wvfskuep0qyfhwumn8ghj7ur4wfcxcetsv9njuetn9uq3wamnwvaz7tmjv4kxz7fwwpexjmtpdshxuet59uqjzamnwvaz7tmrveex2mrp0yh8qatgvd5x7tnhdaexketjwvhxgetk9uq3camnwvaz7tmxd9shger9de5k2u3wdehhxarjxyhxxmmd9uq3uamnwvaz7tmxv4jkguewdehhxarj9e3xzmny9akxzmn89ae82qgewaehxw309a3h2um5dakjuenfv96x5ctx9e3k7mf0qy0hwumn8ghj7erjv4sk6mmxw35x2wfswvhxummnw3erztnrdakj7qgewaehxw309ajkx6r09emk2cnnda3kket59ehhyee0qyvhwumn8ghj7et49ec82unsd3jhyetvv9ujucm0d5hszynhwden5te0v4uk2uewvcmh5tnfduhszythwden5te0xy6rqtnxxaazu6t09uqs6amnwvaz7tmxxaazu6t09uq3qamnwvaz7tmp9ehx7uewd3hkctcpzpmhxue69uhk2tnwdaejumr0dshsz3thwden5te0vcmk6vn6d95xz6rzvfarw7r3daux6atcdv6k2aejwejkxunev36n2dndwam8j7nn0fchqdmgda5x7cn3dfuhjcmevshx7mnfdahz7qg6waehxw309ajkc6t5v4ejumn0wd68yct5dyhx7un89uq3kamnwvaz7tmrdpex7mnfvdkx2tnyw3hkummw9e3k7mf0qyw8wumn8ghj7cn4vd4k2apwvdhhyctrd3jjuum0vd5kzmp0qy28wumn8ghj7cmgdae82uewwp48vtndv5hsqgxx4c0sxkann65px5mqcxcxptxeffyqf2f30nmahmfte0tgvm5wqsvu76ja

someone 11mo ago

yes it is smart but it also has a lot of misinformation!

yes it is free but the lies in it hurts and costly!

nostr:nevent1qvzqqqqqqypzpq35r7yzkm4te5460u00jz4djcw0qa90zku7739qn7wj4ralhe4zqythwumn8ghj7un9d3shjtnswf5k6ctv9ehx2ap0qyt8wumn8ghj7un9d3shjtnddaehgu3wwp6kytcppemhxue69uhkummn9ekx7mp0qy2hwumn8ghj7un9d3shjtnyv9kh2uewd9hj7qgewaehxw309ac8yetdd96k6tnswf5k6ctv9ehx2ap0qy2hwumn8ghj7ct9va5hxtn4w3ux7tn0dejj7qgcwaehxw309ask2emfwvh8yetvv9uhgety9ejx2tcpremhxue69uhkzurf9en8yet9veex7mfwwdcxzcm99amrztmhwvq3qamnwvaz7te3vc6nyc3w0puh5tcp9dmhxue69uhkzerxv9ekvctnveskgumyveshxenpwdnrxvfjxv6rzvn9wanxzuew0puh5tcpzamhxue69uhkzem0wfsjumn0wd68yvfwvdhk6tcpr9mhxue69uhkzefwwp6hyurvv4ex2mrp0yhxxmmd9uqs7amnwvaz7tehxqex2tnrdakj7qgswaehxw309asjumn0wvhxcmmv9uq35amnwvaz7tmpveexjcmp9ehx7um5wghx5mmzw4exwtcpr9mhxue69uhkzunrxyhxzunrv9jx2mrpvfejucm09uq35amnwvaz7tmpv36kcapwxyu8qmr4wvh8xmmrd9skctcpzfmhxue69uhnztnwdaa8gu3wvdhk6tcpz9mhxue69uhnzdps9enrw73wd9hj7qfswaehxw309uersdmpwe6kz6r8vaskgumyxgcnxun8xyuxzempxdukwvmhdpnnseecv9nxwtnc09az7qpqt8ga27w8ke5qv6zm23a6mjqgp63z8ena9zwejx33c7fqvpf4phuqwx4cx2

Replying to

rabble

This new stuff people are doing with Open AI’s Canvas and Operator feels like a major shift of how we build software.

Look at these videos: https://x.com/minchoi/status/1883554868293779898

And then there’s Deepseek’s new R1 model which is open source, can run on a decent desktop computer, and is better than anything openAI has done with regards to coding… almost as good as claude sonnet… I put $10 in to deepseek and have been using it a lot, only to have spent a few cents!

This stuff is improving so fast, and the cost is dropping so much… it’s hard to keep up.

Look at this, you can run your own RAG at o1 levels using deepseek r1!

https://lightning.ai/akshay-ddods/studios/compare-deepseek-r1-and-openai-o1-using-rag

I think we’re seeing Open Source win against big proprietary tech platforms, and it really makes me feel better about the AI future.

One question here, is why aren’t we talking more about what we can do with it on Nostr? I know folks are excited about bitcoin, which is cool, but look at what we can start doing with the AI… Who’s talking about it on Nostr? How do we use AI and these. emerging open models in building / using nostr… i know we’ve got DVM’s but that’s kind of a limited job request system.

someone 11mo ago

Nostr is a great wisdom source. Here is an LLM based on Nostr notes:

https://huggingface.co/some1nostr/Nostr-Llama-3.1-8B

A Leaderboard where I measure the human alignment / basedness:

https://wikifreedia.xyz/based-llm-leaderboard/npub1nlk894teh248w2heuu0x8z6jjg2hyxkwdc8cxgrjtm9lnamlskcsghjm9c

someone 11mo ago

nostr:nprofile1qyd8wumn8ghj7urewfsk66ty9enxjct5dfskvtnrdakj7qgmwaehxw309aex2mrp0yh8wetnw3jhymnzw33jucm0d5hszymhwden5te0wahhgtn4w3ux7tn0dejj7qpq80cvv07tjdrrgpa0j7j7tmnyl2yr6yr7l8j4s3evf6u64th6gkwsh4nk43 which one is the correct name?

someone 11mo ago

if nothing comes for free, why is deepseek R1 free?

its not actually. the cost you pay is the lies you are getting injected. they are slowly detaching AI from human values. i know this because i measure this. each of these smarter models comes at a cost. they are no longer telling the truth in health, nutrition, fasting, faith, .... in many domains.

while everybody cheers for the open source AGI (!) that you can run on your computer, i am feeling bad about how this is going. please be mindful about the LLM that you are using. they are getting worse. some old models like llama 3.1 are better.

i would say my models are the best in terms of alignment. i have been carefully curating my sources. i am hosting them on https://pickabrain.ai

pick the characters with brain symbols next to them. they are much better aligned.

Replying to