
From Word Models to World Models: Training AI for Adversarial Robustness
238
Shift LLMs from next-token to next-state prediction by training in multi-agent, hidden-state environments so their outputs survive adversarial adaptation.

Shift LLMs from next-token to next-state prediction by training in multi-agent, hidden-state environments so their outputs survive adversarial adaptation.