Yilun Zhao
15 papers · 523 citations
Most-cited papers
- Investigating Data Contamination In Modern Benchmarks For Large Language Models2023 · 133 citations
- Benchmarking Generation And Evaluation Capabilities Of Large Language Models For Instruction Controllable Summarization2023 · 88 citations
- Evaluating Llms At Detecting Errors In LLM Responses2024 · 53 citations
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code2023 · 49 citations
- Docmath-eval: Evaluating Math Reasoning Capabilities Of Llms In Understanding Long And Specialized Documents2023 · 43 citations
Topics