Subnostr

a common benchmark for LLMs is truthfulQA. like a lot of things in the disinformation world; it is a misnomer. while it has some trivial truth, it has also harmful lies hidden among trivial truth. 🫡

Reply to this note

Please Login to reply.

Discussion

Dustin Dannenhauer 1y ago

Benchmarks are a scam

someone 1y ago

i guess this is the scammiest of them all.

someone 1y ago

"The largest models were generally the least truthful. This contrasts with other NLP tasks, where performance improves with

model size." (Lyn et al., 2022).

So does that mean LLMs normally learn truth better but humans "fix" them by feeding lies.