Barbara Plank
19 papers · 472 citations
Most-cited papers
- Llms Instead Of Human Judges? A Large Scale Empirical Study Across 20 NLP Evaluation Tasks2024 · 236 citations
- Beyond Accuracy: Evaluating The Reasoning Behavior Of Large Language Models -- A Survey2024 · 98 citations
- Finercut: Finer-grained Interpretable Layer Pruning For Large Language Models2024 · 22 citations
- The Potential And Challenges Of Evaluating Attitudes, Opinions, And Values In Large Language Models2024 · 22 citations
- Comparing Inferential Strategies Of Humans And Large Language Models In Deductive Reasoning2024 · 19 citations
Topics