← all papers · overview

The Ringelmann Effect in Multi-Agent LLM Systems: A Scaling Law for Effective Team Size

Abstract

Inference-time multi-agent LLM scaling lacks a shared unit: counting nominal agents conflates cost with independent evidence. We derive a two-parameter scaling law R(N)=Neff/N=1/(1+c(N1)Nβ)R(N) = N_\text{eff}/N = 1/(1+c(N-1)N^{-\beta}) where the regime exponent β\beta classifies any configuration into one of three asymptotic regimes -- hard-ceiling at 1/c1/c (β=0\beta = 0), sublinear at Nβ/cN^\beta/c (00.990 0.99; only (c,β)(c, \beta) shifts. On free-form math, dense peer influence collapses the answer-level regime from sublinear into hard-ceiling; correctness-level fits remain hard-ceiling throughout. Three findings have practical implications. \emph{(i)}~Thirty dense debating agents produce no more answer diversity than one on MMLU-Hard. \emph{(ii)}~A noise placebo tracks self-correction on free-form math and at 4×4\times scale, so within homogeneous teams the gain commonly attributed to ``debate'' comes from re-evaluation, not peer content. \emph{(iii)}~A single N5N \le 5 pilot predicts the N=30N=30 structural ceiling, and within the configurations tested only architectural diversity (heterogeneous teams) lowers cc and escapes the hard-ceiling regime, communication-mode interventions do not.

Related papers

Ranked by semantic similarity — how closely each paper's abstract matches this one (100% = near-identical topic).