Nostr Web Client

I am lucky enough to have a PC that can run the lastest gpt-oss models from OpenAI. I am not impressed. It is like talking to Drax the Destroyer from Guardians of the Galaxy. Even when you explain the joke to it, it still misunderstands.

It also lies. Probably not purposely, it merely says what a human might say even though it is impossible for it.

ᴛʜᴇ ᴅᴇᴀᴛʜ ᴏꜰ ᴍʟᴇᴋᴜ 5mo ago

openAI's shit is trash. i've been using the coding variant of mistral lately and it works nice. i don't even bother to waste my time talking to it like it's intelligent, it's just a clever monkey that can remix text, and claude seems to be pretty good at figuring out plans and executing them to write code, tests and find bugs.

i'd much rather run models from hugging face on my GPU than use those shitty cloud services. gonna be very happy when they finally enable local models for junie so i can just use codestral or maybe try some other models that are more focused on programming. the general purpose models provided by cloud providers are, quite frankly, irrelevant and full of bullshit that you don't need for programming work. and i wouldn't trust them to do much writing either, since they are completely infested with commie propaganda screeds.

i literally asked gemini one time to summarise some subject or other, and forgot to give it the link to the thing i wanted it to work on. it started on this screed about renewable energy and shit and i was like, so, this is what it will talk about if you don't give it a specific topic. imagine trying to get this thing to talk about solar forcing of weather and earthquakes. lol. would be fun to read the "thinking" output when it does this and it says "well, this is a thing, but i'm not allowed to say that"

Reply to this note

Please Login to reply.

Discussion

Daniel Wigton 5mo ago

I did run these locally. They just can't understand conversation shifts. To be fair not even grok 3 did that well in my latest test. All I did was ask an easy question "how many u in the word strawberry?" And follow is up with a joke question "Isn't there a double u? So should it be 2?"

Any human with half a brain would have realized I was making a stupid joke. I don't really expect AI to catch that, but I expect them to understand what happened after I point it out. gpt-oss just doubles down and makes tables about how you are wrong.

ᴛʜᴇ ᴅᴇᴀᴛʜ ᴏꜰ ᴍʟᴇᴋᴜ 5mo ago

these models are mostly trained by mids, so, they are mid.

i'd love to see what legit intelligent people would do with them.

from the snippets i've seen of grok it's on the high side of mid compared to gpt and gemini. gemini seems to be straight up woke

Daniel Wigton 5mo ago

Grok is by far the best. The only model that understands after you explain.

772f9545... 5mo ago

> The word "strawberry" contains two 'u' characters. Here's how it breaks down:

> stuwburry