Replying to Avatar Vitor Pamplona

In 3 years, we will see LLM ASICs on a USB Stick.

This paper eliminates the need for costly matrix multiplication in LLMs claiming a 10x reduction of memory use during compute. If they can turn a 70b model into a 7b model, we are running these things on phones.

https://arxiv.org/abs/2406.02528

Avatar
Jim Smij 1y ago

they'll be in everything...

just a matter of time.

#smij #zapd

nostr:nevent1qqsg9c49el0uvn262eq8j3ukqx5jvxzrgcvajcxp23dgru3acfsjqdgppamhxue69uhkummnw3ezumt0d5pzq3svyhng9ld8sv44950j957j9vchdktj7cxumsep9mvvjthc2pjuqvzqqqqqqyv9da4w

Reply to this note

Please Login to reply.

Discussion

No replies yet.