Poor Man's Interaction Models

by Overthinking Machines Lab

Training Laguna XS.2 to (1) preemptively model two-side dialogue, and (2) have a sense of time by using special tokens for micro-pauses: we get pseudo-duplex interaction, including turn modelling, in a text-only model.

Models: Laguna XS.2 (Non-interactive), with the interaction model system prompt/harness but no training (Interactive), and with training (Interactive RL).