Han Zhao
12 papers Β· 1037 citations
Most-cited papers
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts2024 Β· 355 citations
- RLHF Workflow: From Reward Modeling To Online RLHF2024 Β· 236 citations
- Mitigating The Alignment Tax Of RLHF2023 Β· 166 citations
- Arithmetic Control Of Llms For Diverse User Preferences: Directional Preference Alignment With Multi-objective Rewards2024 Β· 140 citations
- Cobra: Extending Mamba To Multi-modal Large Language Model For Efficient Inference2024 Β· 121 citations
Topics