QwQ 32B was published today and I already tested it for AHA Leaderboard. The results are not that good! It did better than its predecessor (Qwen 2.5) in fasting and nutrition but worse in domains like nostr, bitcoin and faith. Overall worse than previous.

LLMs are getting detached from humans. Y'all have been warned, lol.

Reply to this note

Please Login to reply.

Discussion

The worst part is that llm's are telling us lies in a nice way. The youngs are consuming llm outputs as facts and this will ruin manybthings in the future.