Peter Henderson
12 papers Β· 11545 citations
Most-cited papers
- On The Opportunities And Risks Of Foundation Models2021 Β· 6272 citations
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!2023 Β· 1086 citations
- Legalbench: A Collaboratively Built Benchmark For Measuring Legal Reasoning In Large Language Models2023 Β· 365 citations
- Safety Alignment Should Be Made More Than Just A Few Tokens Deep2024 Β· 360 citations
Topics