hopefully google's "systolic array" memory will start to gain some market share. it's entirely different and built for storing LLMs without the cpu/memory bottleneck
Discussion
The first thing that comes to mind is "how can that be used to further compromise identity?"
I'm just always suspicious.
But that does sound like a cool system if it works out.