MuJoCo
Emerging8papers using it
2017first seen
Papers using MuJoCo (8)
- When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RLWILD-SCAV: Benchmarking FPS Gaming AI On Unity3d-based EnvironmentsEfficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous ControlOM2P: Offline Multi-Agent Mean-Flow PolicyThe Intentional Unintentional Agent: Learning To Solve Many Continuous Control Tasks SimultaneouslyLanguage as an Abstraction for Hierarchical Deep Reinforcement LearningCooperative Heterogeneous Deep Reinforcement LearningScalable Multi-agent Covering Option Discovery based on Kronecker Graphs