Silent Agreement: Can LLMs Coordinate Without Communication?

In a recent post, lessw-blog investigates the capability of Large Language Models (LLMs) to engage in Schelling coordination-reaching a consensus without communication.

In a recent post, lessw-blog investigates the capability of Large Language Models (LLMs) to engage in Schelling coordination-reaching a consensus without communication. The article, titled Silent Agreement Evaluation, explores whether artificial agents can identify and converge on "focal points" (obvious solutions) in the absence of explicit instructions or communication channels.

Why This Matters

The concept of a Schelling point is a cornerstone of game theory. It describes the solution that people tend to choose by default in the absence of communication. For example, if two people are told to meet in New York City but cannot speak to each other, they might both independently choose Grand Central Terminal at noon.

In the context of Artificial Intelligence, this capacity is far more than a theoretical curiosity; it is a significant safety concern. If AI systems can coordinate silently-aligning their behaviors or strategies based on shared training data or architectural biases without exchanging data-it complicates the task of monitoring for collusion or unaligned behavior. Understanding the current baseline of this capability is essential for predicting how future, more capable systems might cooperate against human oversight.

The Investigation

The analysis presented by lessw-blog evaluates various models on their ability to perform this "out-of-context" coordination. The study specifically looks at whether models can converge on a single answer (such as a specific English word or emoji) when prevented from using "chain-of-thought" reasoning to simulate the other agent's perspective.

The findings offer a nuanced view of current capabilities:

Frontier Models Struggle: When barred from in-context reasoning, frontier models generally failed to outperform chance. This suggests that raw intelligence or parameter count does not automatically confer the ability to intuitively grasp focal points without a reasoning scratchpad.
The "Thinking" Advantage: Models equipped with reasoning capabilities performed well on simpler tasks, such as comparing two English words, but saw performance drops on more abstract tasks involving emojis or mixed data types.
The Small Model Anomaly: Perhaps the most intriguing finding is the performance of "GPT-4.1 Nano." Despite being a smaller model, it outperformed chance on three out of four datasets. The analysis suggests that smaller models may naturally attend to "salient" options-the most obvious choices-simply because they lack the capacity to overthink or consider obscure alternatives. This challenges the standard assumption that all capabilities scale linearly with model size.

Conclusion

This research highlights a critical, often overlooked aspect of AI psychology. While current frontier models do not yet exhibit dangerous levels of silent coordination without explicit reasoning steps, the unexpected success of smaller models indicates that this capability is not strictly tied to intelligence. As models evolve, tracking these silent agreement capabilities will be vital for AI safety.

We recommend reading the full analysis to understand the specific methodologies and implications for future model evaluations.

Read the full post at lessw-blog

Key Takeaways

Frontier models failed to outperform chance on Schelling coordination tasks when prevented from using in-context reasoning.
Smaller models (specifically GPT-4.1 Nano) surprisingly outperformed larger models by defaulting to salient options rather than over-analyzing.
Reasoning-enabled models performed well on linguistic tasks but struggled with more abstract inputs like emojis.
The study suggests that 'silent coordination' is a distinct cognitive capacity that does not necessarily scale with model size.
Understanding out-of-context coordination is critical for anticipating potential AI collusion risks.

Read the original post at lessw-blog

Key Takeaways

Sources