Awesome Reinforcement Learning
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸŽ“LearnπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“°NewsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

Dawn Song

14 papers Β· 8928 citations
Most-cited papers
  • Measuring Massive Multitask Language Understanding
    2020 Β· 7766 citations
  • Representation Engineering: A Top-down Approach To AI Transparency
    2023 Β· 899 citations
  • Rigorllm: Resilient Guardrails For Large Language Models Against Undesired Content
    2024 Β· 76 citations
  • Guardagent: Safeguard LLM Agents By A Guard Agent Via Knowledge-enabled Reasoning
    2024 Β· 73 citations
  • Decoding Compressed Trust: Scrutinizing The Trustworthiness Of Efficient Llms Under Compression
    2024 Β· 54 citations
  • VERINA: Benchmarking Verifiable Code Generation
    2025
  • CUBE: A Standard For Unifying Agent Benchmarks
    2026
  • Opensage: Self-programming Agent Generation Engine
    2026
  • Devops-gym: Benchmarking AI Agents In Software Devops Cycle
    2026
Topics
Safety & AlignmentEvaluationBenchmarksCode AgentsModel ArchitectureIn-Context LearningSurvey PaperEfficiencyTraining TechniquesAgentic

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.