Pretty intuitive next step. Interested to see any implementation detail in the paper
More info on some of the AI projects China is experimenting on.
Tencent + Tsinghua just dropped a paper called Continuous Autoregressive Language Models (CALM) and it basically kills the “next-token” paradigm every LLM is built on.
Instead of predicting one token at a time, CALM predicts continuous vectors that represent multiple tokens at once.
Meaning: the model doesn’t think “word by word”… it thinks in ideas per step.
Here’s why that’s insane
→ 4× fewer prediction steps (each vector = ~4 tokens)
→ 44% less training compute
→ No discrete vocabulary pure continuous reasoning
→ New metric (BrierLM) replaces perplexity entirely
They even built a new energy-based transformer that learns without softmax no token sampling, no vocab ceiling.
It’s like going from speaking Morse code… to streaming full thoughts.
If this scales, every LLM today is obsolete.
Report taken from Robert Yousseff, link to paper
https://arxiv.org/abs/2510.27688

Discussion
No replies yet.