Agent-pro: Learning To Evolve Via Policy-level Reflection And Optimization
2024 Β· Wenqi Zhang, Ke Tang, Hai Wu, et al.
Abstract
Large Language Models (LLMs) exhibit robust problem-solving capabilities for diverse tasks. However, most LLM-based agents are designed as specific task solvers with sophisticated prompt engineering, rather than agents capable of learning and evolving through interactions. These task solvers necessitate manually crafted prompts to inform task rules and regulate LLM behaviors, inherently incapacitating to address complex dynamic scenarios e.g., large interactive games. In light of this, we propose Agent-Pro: an LLM-based Agent with Policy-level Reflection and Optimization that can learn a wealth of expertise from interactive experiences and progressively elevate its behavioral policy. Specifically, it involves a dynamic belief generation and reflection process for policy evolution. Rather than action-level reflection, Agent-Pro iteratively reflects on past trajectories and beliefs, fine-tuning its irrational beliefs for a better policy. Moreover, a depth-first search is employed for pol
Authors
(none)
Tags
Stats
Related papers
- Proagent: Building Proactive Cooperative Agents With Large Language Models (2023)12.74
- Agentevolver: Towards Efficient Self-evolving Agent System (2025)0.00
- Policyevolve: Evolving Programmatic Policies By Llms For Multi-player Games Via Population-based Training (2025)0.00
- End-to-end Optimization Of Llm-driven Multi-agent Search Systems Via Heterogeneous-group-based Reinforcement Learning (2025)0.00
- Discovering Multiagent Learning Algorithms With Large Language Models (2026)2.05
- Llm-explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven By Large Language Models (2025)0.00
- Tompo: Training LLM Strategic Decision Making From A Multi-agent Perspective (2025)0.00
- Towards Agentic Self-learning Llms In Search Environment (2025)0.00