K^2-agent: Co-evolving Know-what And Know-how For Hierarchical Mobile Device Control
2026 Β· Zhe Wu, Donglin Mo, Hongjin Lu, et al.
Abstract
Existing mobile device control agents often perform poorly when solving complex tasks requiring long-horizon planning and precise operations, typically due to a lack of relevant task experience or unfamiliarity with skill execution. We propose K2-Agent, a hierarchical framework that models human-like cognition by separating and co-evolving declarative (knowing what) and procedural (knowing how) knowledge for planning and execution. K2-Agent's high level reasoner is bootstrapped from a single demonstration per task and runs a Summarize-Reflect-Locate-Revise (SRLR) loop to distill and iteratively refine task-level declarative knowledge through self-evolution. The low-level executor is trained with our curriculum-guided Group Relative Policy Optimization (C-GRPO), which (i) constructs a balanced sample pool using decoupled reward signals and (ii) employs dynamic demonstration injection to guide the model in autonomously generating successful trajectories for training. On the challenging A
Authors
(none)
Tags
Stats
Related papers
- Mobile-r1: Towards Interactive Capability For Vlm-based Mobile Agent Via Systematic Training (2026)0.00
- I Know How: Combining Prior Policies To Solve New Tasks (2024)2.26
- Multi-agent Common Knowledge Reinforcement Learning (2018)0.00
- Cautiously-optimistic Knowledge Sharing For Cooperative Multi-agent Reinforcement Learning (2023)5.84
- Knowledge Transfer In Multi-task Deep Reinforcement Learning For Continuous Control (2020)0.00
- KD-MARL: Resource-aware Knowledge Distillation In Multi-agent Reinforcement Learning (2026)0.00
- Knowru: Knowledge Reusing Via Knowledge Distillation In Multi-agent Reinforcement Learning (2021)9.23
- Hierarchical Cooperative Multi-agent Reinforcement Learning With Skill Discovery (2019)5.24