J. Zico Kolter
11 papers · 1646 citations
Most-cited papers
- Representation Engineering: A Top-down Approach To AI Transparency2023 · 899 citations
- TOFU: A Task Of Fictitious Unlearning For Llms2024 · 390 citations
- Rethinking LLM Memorization Through The Lens Of Adversarial Compression2024 · 100 citations
- Idiosyncrasies In Large Language Models2025 · 26 citations
Topics