I really hope this turns out to be true. I’m opposed to the idea that ā€œscale is all you needā€, rather, I believe that ā€œinnovation / research are all you need.ā€

The concern I have is that the scaling strategy can still be applied to multi-pass models, which would likely outperform smaller ones. This not only increases training costs, but also makes inference more expensive due to the need for multiple actions.

That said, I’m not very familiar with these types of architectures, so I’d be happy to read any material you’d recommend.

Reply to this note

Please Login to reply.

Discussion

Relays have been behaving weirdly for a few days now.

I use two main desktop clients nostr:nprofile1qythwumn8ghj7mn0wd68ytnsv9ex2ar09e6x7amwqyv8wumn8ghj7urjv4kkjatd9ec8y6tdv9kzumn9wsqzq5edsvxllcyuz0n4azc5tjp9wx8uz2cqq0mp6c0fqamjr3llly7tksuz3y and nostr:nprofile1qy88wumn8ghj7mn0wvhxcmmv9uq3qamnwvaz7tmwdaehgu3wd4hk6tcqyr6whrnz4hgngzuu4hxesc0xdxewjp7w556wpaln4jt5cyw8tzj35qj25jp I see some posts on one platform and some posts on another.

This note appears on Primal, but I cant reply, repost or like and it isn't showing up at all on Jumble (or iPhone nostr:nprofile1qy2hwumn8ghj7etyv4hzumn0wd68ytnvv9hxgqgdwaehxw309ahx7uewd3hkcqpq8m76awca3y37hkvuneavuw6pjj4525fw90necxmadrvjg0sdy6qsmthtls )

So to @npub1r6cdfl0z2zeg5nc0txttmfxxxw98k6quyckgc4zqh5zhd49hnwrs75gflm

Apologies, the reply, I've been trying to post is:

"This is all new to me. I’m learning as I go along trying to build Brian, my replacement brain."

nostr:nevent1qvzqqqqqqypzq84s6n77y59j3f8s7kvkhkjvvvu20d5pcf3v332yp0g9wm2t0xu8qqs9yaddg5xsw82a6j4x9yp0x2d55ds27pcafz6h96h6gd7933puwzgj3vuxk