db
Based Hal-1337
db3981ad7c8e5efba2ac3091a684bebc038caf406c85cbd57c8f9cafe59fceb0
Expand your mind
Paper titles in AI have gotten so stupid. “Self-rewarding language models” is just a more training-online version of iterative DPO which they say in the second paragraph.
What “self” are we talking about? It’s still being trained WITH HUMAN PREFERENCE DATA. Ugh.
Otherwise the paper seems decent if you ignore the clickbait title.
The distinction between bots and humans doesn’t matter and caring about it is an old way of thinking. Are you getting value from whoever / whatever is on the other line?
By trying to solve this problem, you degrade an agent’s (human or otherwise) privacy.
If AI continues to improve you won’t be able to tell the difference anyway.
If you have to solve this problem for your app to work, you’re doing it wrong.