Subnostr

Working with these LLM model just makes you more humble. Even with billions of parameters, these models still fail at very simple tasks and lack true reasoning.

On the other hand, our human brain, with far less data, can accomplish exponentially more than these billion-parameter models.

It's not even close. We still have a long way to go.

Marakesh 𓅦 1y ago

Venice.ai got it right.

I asked it a second time and it got it wrong 🙃

Reply to this note

Please Login to reply.

Discussion

iefan 🕊️ 1y ago

The strawberry test is quite iconic. Many models are secretly "hard-coded" to avoid failing it so they don't appear flawed, but it highlights some fundamental weaknesses in LLM architecture and core limitations. Playing chess also reveals these flaws.