Large language models can do jaw-dropping things. But nobody knows exactly why.

Two years ago, Yuri Burda and Harri Edwards, researchers at the San Francisco–based firm OpenAI, were trying to find out what it would take to get a large language model to do basic arithmetic. They wanted to know how many examples of adding up two numbers the model needed to see before it was able…

https://www.technologyreview.com/2024/03/04/1089403/large-language-models-amazing-but-nobody-knows-why/

Reply to this note

Please Login to reply.

Discussion

No replies yet.