Abstract
Inference-time multi-agent LLM scaling lacks a shared unit: counting nominal agents conflates cost with independent evidence. We derive a two-parameter scaling law where the regime exponent classifies any configuration into one of three asymptotic regimes -- hard-ceiling at (), sublinear at (; only shifts. On free-form math, dense peer influence collapses the answer-level regime from sublinear into hard-ceiling; correctness-level fits remain hard-ceiling throughout. Three findings have practical implications. \emph{(i)}~Thirty dense debating agents produce no more answer diversity than one on MMLU-Hard. \emph{(ii)}~A noise placebo tracks self-correction on free-form math and at scale, so within homogeneous teams the gain commonly attributed to ``debate'' comes from re-evaluation, not peer content. \emph{(iii)}~A single pilot predicts the structural ceiling, and within the configurations tested only architectural diversity (heterogeneous teams) lowers and escapes the hard-ceiling regime, communication-mode interventions do not.