← all papers · overview

Empowering VLMs for Few-Shot Multimodal Time Series Classification via Tailored Agentic Reasoning

Abstract

In this paper, we propose the first VLM\underline{\textbf{M}} a\underline{\textbf{a}}gentic r\underline{\textbf{r}}easoning framework for few-s\underline{\textbf{s}}hot multimodal T\underline{\textbf{T}}ime S\underline{\textbf{S}}eries C\underline{\textbf{C}}lassification (MarsTSC\textbf{MarsTSC}), which introduces a self-evolving knowledge bank as a dynamic context iteratively refined via reflective agentic reasoning. The framework comprises three collaborative roles: i) Generator conducts reliable classification via reasoning; ii) Reflector diagnoses the root causes of reasoning errors to yield discriminative insights targeting the temporal features overlooked by Generator; iii) Modifier applies verified updates to the knowledge bank to prevent context collapse. We further introduce a test-time update strategy to enable cautious, continuous knowledge bank refinement to mitigate few-shot bias and distribution shift. Extensive experiments across 12 mainstream time series benchmarks demonstrate that MarsTSC\textbf{MarsTSC} delivers substantial and consistent performance gains across 6 VLM backbones, outperforming both classical and foundation model-based time series baselines under few-shot conditions, while producing interpretable rationales that ground each classification decision in human-readable feature evidence.

Related papers

Ranked by semantic similarity — how closely each paper's abstract matches this one (100% = near-identical topic).