Awesome Papers
LLMs
Quantum
SimSearch
AI4Code
Agents
CV
Robotics
Cyber
AI4Sci
Speech
RL
MM
GenAI
Graph
TS
RecSys
FL
☾
☀
← authors
·
overview
Pengyu Cheng
11
papers ·
236
citations
Most-cited papers
Adversarial Preference Optimization: Enhancing Your Alignment Via RM-LLM Game
2023 · 35 citations
Topics
Safety & Alignment
Reinforcement Learning
Fine-Tuning
🤖
Ask AI