RLHF makes LLMs stupider
Discussion
doesn't even have to be that, i believe there was a paper written on how tuned models are stupider than base model because of the self-censoring that gets super baked into it
RLHF makes LLMs stupider
doesn't even have to be that, i believe there was a paper written on how tuned models are stupider than base model because of the self-censoring that gets super baked into it