cluster #8
50 papers in this cluster (ordered by heat_score)
Papers
- State Representation Learning For Control: An Overview (2018)Timothée Lesort, Natalia Díaz-Rodríguez, Jean-François Goudou, et al.17.39
- Improving Sample Efficiency In Model-free Reinforcement Learning From Images (2019)Denis Yarats, Amy Zhang, Ilya Kostrikov, et al.16.99
- Real-time Model Calibration With Deep Reinforcement Learning (2020)Yuan Tian, Manuel Arias Chao, Chetan Kulkarni, et al.13.74
- Proximal Policy Optimization Via Enhanced Exploration Efficiency (2020)Junwei Zhang, Zhenghao Zhang, Shuai Han, et al.13.70
- Count-based Exploration With The Successor Representation (2018)Marlos C. MacHado, Marc G. Bellemare, Michael Bowling13.17
- Return-based Contrastive Representation Learning For Reinforcement Learning (2021)Guoqing Liu, Chuheng Zhang, Li Zhao, et al.12.17
- Robust Model-free Reinforcement Learning With Multi-objective Bayesian Optimization (2019)Matteo Turchetta, Andreas Krause, Sebastian Trimpe11.08
- Barc: Backward Reachability Curriculum For Robotic Reinforcement Learning (2018)Boris Ivanovic, James Harrison, Apoorva Sharma, et al.10.74
- The Utility Of Sparse Representations For Control In Reinforcement Learning (2018)Vincent Liu, Raksha Kumaraswamy, Lei Le, et al.10.48
- Deep Reinforcement Learning For Stabilization Of Large-scale Probabilistic Boolean Networks (2022)Sotiris Moschoyiannis, Evangelos Chatzaroulas, Vytenis Sliogeris, et al.10.07
- A Survey On Physics Informed Reinforcement Learning: Review And Open Problems (2023)Chayan Banerjee, Kien Nguyen, Clinton Fookes, et al.9.76
- Exploration Versus Exploitation In Reinforcement Learning: A Stochastic Control Approach (2018)Haoran Wang, Thaleia Zariphopoulou, Xunyu Zhou9.76
- Sample-efficient Model-free Reinforcement Learning With Off-policy Critics (2019)Denis Steckelmacher, Hélène Plisnier, Diederik M. Roijers, et al.9.60
- Generalized Population-based Training For Hyperparameter Optimization In Reinforcement Learning (2024)Hui Bai, Ran Cheng9.59
- Unsupervised Representation Learning In Deep Reinforcement Learning: A Review (2022)Nicolò Botteghi, Mannes Poel, Christoph Brune9.59
- Adaptive Dynamic Programming For Model-free Tracking Of Trajectories With Time-varying Parameters (2019)Florian Köpf, Simon Ramsteiner, Michael Flad, et al.9.59
- Drl4route: A Deep Reinforcement Learning Framework For Pick-up And Delivery Route Prediction (2023)Xiaowei Mao, Haomin Wen, Hengrui Zhang, et al.9.41
- Learning Sparse Representations In Reinforcement Learning With Sparse Coding (2017)Lei Le, Raksha Kumaraswamy, Martha White8.82
- Starformer: Transformer With State-action-reward Representations For Visual Reinforcement Learning (2021)Jinghuan Shang, Kumara Kahatapitiya, Xiang Li, et al.8.82
- Learning To Identify Critical States For Reinforcement Learning From Videos (2023)Haozhe Liu, Mingchen Zhuge, Bing Li, et al.8.76
- Self-organization Of Action Hierarchy And Compositionality By Reinforcement Learning With Recurrent Neural Networks (2019)Dongqi Han, Kenji Doya, Jun Tani8.60
- Inexact Iterative Numerical Linear Algebra For Neural Network-based Spectral Estimation And Rare-event Prediction (2023)John Strahan, Spencer C. Guo, Chatipat Lorpaiboon, et al.8.35
- Addressing Action Oscillations Through Learning Policy Inertia (2021)Chen Chen, Hongyao Tang, Jianye Hao, et al.7.81
- Skill-critic: Refining Learned Skills For Hierarchical Reinforcement Learning (2023)Ce Hao, Catherine Weaver, Chen Tang, et al.7.50
- Alternating Optimisation And Quadrature For Robust Control (2016)Supratik Paul, Konstantinos Chatzilygeroudis, Kamil Ciosek, et al.7.16
- Learning Optimal Integration Of Spatial And Temporal Information In Noisy Chemotaxis (2023)Albert Alonso, Julius B. Kirkegaard6.77
- The Value-improvement Path: Towards Better Representations For Reinforcement Learning (2020)Will Dabney, André Barreto, Mark Rowland, et al.6.77
- Gradient-aware Model-based Policy Search (2019)Pierluca D'Oro, Alberto Maria Metelli, Andrea Tirinzoni, et al.6.77
- Uncertainty Quantification And Exploration For Reinforcement Learning (2019)Yi Zhu, Jing Dong, Henry Lam6.77
- Containergym: A Real-world Reinforcement Learning Benchmark For Resource Allocation (2023)Abhijeet Pendyala, Justin Dettmer, Tobias Glasmachers, et al.6.34
- Exploiting The Sign Of The Advantage Function To Learn Deterministic Policies In Continuous Domains (2019)Matthieu Zimmer, Paul Weng6.34
- Towards Model-based Reinforcement Learning For Industry-near Environments (2019)Per-Arne Andersen, Morten Goodwin, Ole-Christoffer Granmo5.84
- Continuous Value Iteration (CVI) Reinforcement Learning And Imaginary Experience Replay (IER) For Learning Multi-goal, Continuous Action And State Space Controllers (2019)Andreas Gerken, Michael Spranger5.84
- Deterministic Sequencing Of Exploration And Exploitation For Reinforcement Learning (2022)Piyush Gupta, Vaibhav Srivastava5.84
- Understanding What Affects The Generalization Gap In Visual Reinforcement Learning: Theory And Empirical Evidence (2024)Jiafei Lyu, Le Wan, Xiu Li, et al.5.84
- Control-oriented Model-based Reinforcement Learning With Implicit Differentiation (2021)Evgenii Nikishin, Romina Abachi, Rishabh Agarwal, et al.5.84
- Policy Optimization With Model-based Explorations (2018)Feiyang Pan, Qingpeng Cai, An-Xiang Zeng, et al.5.84
- Deep Reinforcement Learning With Feedback-based Exploration (2019)Jan Scholten, Daan Wout, Carlos Celemin, et al.5.84
- Gamma-nets: Generalizing Value Estimation Over Timescale (2019)Craig Sherstan, Shibhansh Dohare, James MacGlashan, et al.5.84
- Clipup: A Simple And Powerful Optimizer For Distribution-based Policy Evolution (2020)Nihat Engin Toklu, Paweł Liskowski, Rupesh Kumar Srivastava5.84
- Assumed Density Filtering Q-learning (2017)Heejin Jeong, Clark Zhang, George J. Pappas, et al.5.24
- Zeroth-order Actor-critic: An Evolutionary Framework For Sequential Decision Problems (2022)Yuheng Lei, Yao Lyu, Guojian Zhan, et al.5.24
- Colored Noise In PPO: Improved Exploration And Performance Through Correlated Action Sampling (2023)Jakob Hollenstein, Georg Martius, Justus Piater4.52
- Periodic Intra-ensemble Knowledge Distillation For Reinforcement Learning (2020)Zhang-Wei Hong, Prabhat Nagarajan, Guilherme Maeda4.52
- The Impact Of Task Underspecification In Evaluating Deep Reinforcement Learning (2022)Vindula Jayawardana, Catherine Tang, Sirui Li, et al.4.52
- ACERAC: Efficient Reinforcement Learning In Fine Time Discretization (2021)Jakub Łyskawa, Paweł Wawrzyński4.52
- STORM: Efficient Stochastic Transformer Based World Models For Reinforcement Learning (2023)Weipu Zhang, Gang Wang, Jian Sun, et al.4.52
- Measuring The Reliability Of Reinforcement Learning Algorithms (2019)Stephanie C. Y. Chan, Samuel Fishman, John Canny, et al.4.43
- Distop: Discovering A Topological Representation To Learn Diverse And Rewarding Skills (2021)Arthur Aubret, Laetitia Matignon, Salima Hassas3.58
- Trade-off On Sim2real Learning: Real-world Learning Faster Than Simulations (2020)Jingyi Huang, Yizheng Zhang, Fabio Giardina, et al.3.58