Awesome Similarity Search
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

Yi Wu

12 papers Β· 737 citations
Most-cited papers
  • Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study
    2024 Β· 273 citations
  • Bitnet: Scaling 1-bit Transformers For Large Language Models
    2023 Β· 220 citations
  • Language Agents With Reinforcement Learning For Strategic Play In The Werewolf Game
    2023 Β· 144 citations
  • Llm-powered Hierarchical Language Agent For Real-time Human-ai Coordination
    2023 Β· 66 citations
  • Real: Efficient RLHF Training Of Large Language Models With Parameter Reallocation
    2024 Β· 29 citations
  • Beyond Ten Turns: Unlocking Long-horizon Agentic Search With Large-scale Asynchronous RL
    2025
  • Focus On The Core: Empowering Diffusion Large Language Models By Self-contrast
    2026
  • Learning Design And Construction With Varying-sized Materials Via Prioritized Memory Resets
    2022
Topics
EfficiencyTraining TechniquesReinforcement LearningAgenticUncategorizedModel ArchitectureSafety & AlignmentPromptingVision-LanguageFine-Tuning

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.