Yilun Zhao
15 papers Β· 523 citations
Most-cited papers
- Investigating Data Contamination In Modern Benchmarks For Large Language Models2023 Β· 133 citations
- Benchmarking Generation And Evaluation Capabilities Of Large Language Models For Instruction Controllable Summarization2023 Β· 88 citations
- Evaluating Llms At Detecting Errors In LLM Responses2024 Β· 53 citations
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code2023 Β· 49 citations
- Docmath-eval: Evaluating Math Reasoning Capabilities Of Llms In Understanding Long And Specialized Documents2023 Β· 43 citations
Topics