Massive Multitask Language Understanding (MMLU)
Emerging6papers using it
2024first seen
Papers using Massive Multitask Language Understanding (MMLU) (6)
- Confidence-Driven Multi-Scale Model Selection for Cost-Efficient InferenceEnterprise Large Language Model Evaluation BenchmarkLanguage Complexity Measurement as a Noisy Zero-Shot Proxy for
Evaluating LLM PerformanceLAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and GiriamaDFPE: A Diverse Fingerprint Ensemble for Enhancing LLM PerformanceRaCT: Ranking-aware Chain-of-Thought Optimization for LLMs