Awesome Similarity Search
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸŽ“LearnπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

Han Zhao

12 papers Β· 1037 citations
Most-cited papers
  • Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts
    2024 Β· 355 citations
  • RLHF Workflow: From Reward Modeling To Online RLHF
    2024 Β· 236 citations
  • Mitigating The Alignment Tax Of RLHF
    2023 Β· 166 citations
  • Arithmetic Control Of Llms For Diverse User Preferences: Directional Preference Alignment With Multi-objective Rewards
    2024 Β· 140 citations
  • Cobra: Extending Mamba To Multi-modal Large Language Model For Efficient Inference
    2024 Β· 121 citations
Topics
Reinforcement LearningTraining TechniquesSafety & AlignmentFine-TuningEfficiencyModel ArchitectureVision-Language

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.