Awesome Papers
LLMsQuantumSimSearchAI4CodeAgentsCVRoboticsCyberAI4SciSpeechRLMMGenAIGraphTSRecSysFL

← authors · overview

Han Zhao

12 papers · 1037 citations
Most-cited papers
  • Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts
    2024 · 355 citations
  • Arithmetic Control Of Llms For Diverse User Preferences: Directional Preference Alignment With Multi-objective Rewards
    2024 · 140 citations
Topics
Reinforcement LearningSafety & AlignmentTraining TechniquesFine-Tuning

Privacy · Terms

© 2026 Awesome Papers.