openAI's shit is trash. i've been using the coding variant of mistral lately and it works nice. i don't even bother to waste my time talking to it like it's intelligent, it's just a clever monkey that can remix text, and claude seems to be pretty good at figuring out plans and executing them to write code, tests and find bugs.

i'd much rather run models from hugging face on my GPU than use those shitty cloud services. gonna be very happy when they finally enable local models for junie so i can just use codestral or maybe try some other models that are more focused on programming. the general purpose models provided by cloud providers are, quite frankly, irrelevant and full of bullshit that you don't need for programming work. and i wouldn't trust them to do much writing either, since they are completely infested with commie propaganda screeds.

i literally asked gemini one time to summarise some subject or other, and forgot to give it the link to the thing i wanted it to work on. it started on this screed about renewable energy and shit and i was like, so, this is what it will talk about if you don't give it a specific topic. imagine trying to get this thing to talk about solar forcing of weather and earthquakes. lol. would be fun to read the "thinking" output when it does this and it says "well, this is a thing, but i'm not allowed to say that"

Reply to this note

Please Login to reply.

Discussion

I did run these locally. They just can't understand conversation shifts. To be fair not even grok 3 did that well in my latest test. All I did was ask an easy question "how many u in the word strawberry?" And follow is up with a joke question "Isn't there a double u? So should it be 2?"

Any human with half a brain would have realized I was making a stupid joke. I don't really expect AI to catch that, but I expect them to understand what happened after I point it out. gpt-oss just doubles down and makes tables about how you are wrong.

these models are mostly trained by mids, so, they are mid.

i'd love to see what legit intelligent people would do with them.

from the snippets i've seen of grok it's on the high side of mid compared to gpt and gemini. gemini seems to be straight up woke

Grok is by far the best. The only model that understands after you explain.

> The word "strawberry" contains two 'u' characters. Here's how it breaks down:

>

> stuwburry