Han Zhao
12 papers · 1037 citations
Most-cited papers
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts2024 · 355 citations
- Arithmetic Control Of Llms For Diverse User Preferences: Directional Preference Alignment With Multi-objective Rewards2024 · 140 citations
Topics