D4RL
Canonical88papers using it
2,891HF downloads
4HF likes
2024first seen
D4RL Dataset on HuggingFace This repository hosts the pre-downloaded D4RL dataset on HuggingFace. It is designed to provide accelerated data downloading for users, eliminating the need to download the dataset from scratch. Installation To use this dataset, you need to clone it into your local .d4rl directory. Here are
π€ Hugging Faceβ apache-2.0
Papers using D4RL (88)
- Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline DataDirectly Forecasting Belief for Reinforcement Learning with DelaysWavelet Fourier Diffuser: Frequency-Aware Diffusion Model for Reinforcement LearningLearning on One Mode: Addressing Multi-modality in Offline Reinforcement
LearningDiffusion Model Predictive ControlReinforcement Learning via Value Gradient FlowIPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement LearningRobust Regularized Policy Iteration under Transition UncertaintyGEM: Guided Expectation-Maximization for Behavior-Normalized Candidate Action Selection in Offline RLEfficient Anti-exploration via VQVAE and Fuzzy Clustering in Offline Reinforcement LearningSMAC: Score-Matched Actor-Critics for Robust Offline-to-Online TransferFlow Actor-Critic for Offline Reinforcement LearningOff-Policy Actor-Critic with Sigmoid-Bounded Entropy for Real-World Robot LearningAutomatic Constraint Policy Optimization based on Continuous Constraint Interpolation Framework for Offline Reinforcement LearningAgile Reinforcement Learning through Separable Neural ArchitectureGuided Flow Policy: Learning from High-Value Actions in Offline Reinforcement LearningLong-Horizon Model-Based Offline Reinforcement Learning Without ConservatismOptimal Perturbation Budget Allocation for Data Poisoning in Offline Reinforcement LearningAdaptive Replay Buffer for Offline-to-Online Reinforcement LearningFrom Static to Dynamic: Enhancing Offline-to-Online Reinforcement Learning via Energy-Guided Diffusion StratificationOpinion: Towards Unified Expressive Policy Optimization for Robust Robot LearningQuantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile RegressionOne-Step Generative Policies with Q-Learning: A Reformulation of MeanFlowEnhancing Robustness of Offline Reinforcement Learning Under Data Corruption via Sharpness-Aware MinimizationASTRO: Adaptive Stitching via Dynamics-Guided Trajectory RolloutsOffline Reinforcement Learning with Generative Trajectory PoliciesWhat Fundamental Structure in Reward Functions Enables Efficient Sparse-Reward Learning?Diffusion Policies with Offline and Inverse Reinforcement Learning for Promoting Physical Activity in Older Adults Using Wearable SensorsDAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred TransitionsUnleashing Flow Policies with Distributional CriticsRobust Policy Expansion for Offline-to-Online RL under Diverse Data CorruptionOffline-to-Online Reinforcement Learning with Classifier-Free Diffusion GenerationOne-Step Flow Q-Learning: Addressing the Diffusion Policy Bottleneck in Offline Reinforcement LearningAdaptive Scaling of Policy Constraints for Offline Reinforcement LearningOnline Pre-Training for Offline-to-Online Reinforcement LearningConsistency Trajectory Planning: High-Quality and Efficient Trajectory Optimization for Offline Model-Based Reinforcement LearningShould We Ever Prefer Decision Transformer for Offline Reinforcement Learning?Offline Reinforcement Learning with Wasserstein Regularization via Optimal Transport MapsFrom Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement LearningBelief-Based Offline Reinforcement Learning for Delay-Robust Policy OptimizationBiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement LearningOffline RL with Smooth OOD Generalization in Convex Hull and its NeighborhoodMOORL: A Framework for Integrating Offline-Online Reinforcement LearningCAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy OptimizationAccelerating Residual Reinforcement Learning with Uncertainty EstimationTROFI: Trajectory-Ranked Offline Inverse Reinforcement LearningHabitizing Diffusion Planning For Efficient And Effective Decision MakingBeyond the Known: Decision Making with Counterfactual Reasoning Decision TransformerAnalytic Energy-Guided Policy Optimization for Offline Reinforcement
LearningTaming OOD Actions for Offline Reinforcement Learning: An Advantage-Based ApproachPretraining a Shared Q-Network for Data-Efficient Offline Reinforcement LearningImagination-Limited Q-Learning for Offline Reinforcement LearningDiffusion Self-Weighted Guidance for Offline Reinforcement LearningLearning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RLVIPO: Value Function Inconsistency Penalized Offline Reinforcement LearningModel-Based Offline Reinforcement Learning with Adversarial Data
AugmentationLearning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-NetworkFlow Q-LearningBehavior-Regularized Diffusion Policy Optimization for Offline Reinforcement LearningSR-Reward: Taking The Path More TraveledPIQL: Projective Implicit Q-Learning with Support Constraint for Offline Reinforcement LearningM$^3$PC: Test-time Model Predictive Control for Pretrained Masked
Trajectory ModelSAMG: Offline-to-Online Reinforcement Learning via
State-Action-Conditional Offline Model GuidanceQ-Distribution guided Q-learning for offline reinforcement learning:
Uncertainty penalized Q-value via consistency modelKAN v.s. MLP for Offline Reinforcement LearningNetworkGym: Reinforcement Learning Environments for Multi-Access Traffic
Management in Network SimulationSUMO: Search-Based Uncertainty Estimation for Model-Based Offline
Reinforcement LearningForward KL Regularized Preference Optimization for Aligning Diffusion
PoliciesQ-value Regularized Decision ConvFormer for Offline Reinforcement
LearningPlanning Transformer: Long-Horizon Offline Reinforcement Learning with
Planning TokensRethinking Optimal Transport in Offline Reinforcement LearningOffline Behavior DistillationReturn Augmented Decision Transformer for Off-Dynamics Reinforcement LearningHypercube Policy Regularization Framework for Offline Reinforcement LearningConstrained Latent Action Policies for Model-Based Offline Reinforcement LearningEnhancing Decision Transformer with Diffusion-Based Trajectory Branch
GenerationAre Expressive Models Truly Necessary for Offline RL?Goal-Conditioned Data Augmentation for Offline Reinforcement LearningDRDT3: Diffusion-Refined Decision Test-Time Training ModelHabitizing Diffusion Planning for Efficient and Effective Decision
MakingDecision SpikeFormer: Spike-Driven Transformer for Decision MakingPolicy-Based Trajectory Clustering in Offline Reinforcement LearningDouble Check My Desired Return: Transformer with Target Alignment for Offline Reinforcement LearningHuman-in-the-Loop Bandwidth Estimation for Quality of Experience Optimization in Real-Time Video CommunicationDiffusion Policies with Value-Conditional Optimization for Offline Reinforcement LearningCS-GBA: A Critical Sample-based Gradient-guided Backdoor Attack for Offline Reinforcement LearningFast and Highly Expressive Policy Learning for Offline Reinforcement Learning via Bootstrapped Flow Q-LearningAn Optimal Discriminator Weighted Imitation Perspective for
Reinforcement Learning