More info on some of the AI projects China is experimenting on.

Tencent + Tsinghua just dropped a paper called Continuous Autoregressive Language Models (CALM) and it basically kills the “next-token” paradigm every LLM is built on.

Instead of predicting one token at a time, CALM predicts continuous vectors that represent multiple tokens at once.

Meaning: the model doesn’t think “word by word”… it thinks in ideas per step.

Here’s why that’s insane

→ 4× fewer prediction steps (each vector = ~4 tokens)

→ 44% less training compute

→ No discrete vocabulary pure continuous reasoning

→ New metric (BrierLM) replaces perplexity entirely

They even built a new energy-based transformer that learns without softmax no token sampling, no vocab ceiling.

It’s like going from speaking Morse code… to streaming full thoughts.

If this scales, every LLM today is obsolete.

Report taken from Robert Yousseff, link to paper

https://arxiv.org/abs/2510.27688

Reply to this note

Please Login to reply.

Discussion

Pretty intuitive next step. Interested to see any implementation detail in the paper

noyce