Qian Liu
14 papers · 1142 citations
Most-cited papers
- Bigcodebench: Benchmarking Code Generation With Diverse Function Calls And Complex Instructions2024 · 471 citations
- Faithful Logical Reasoning Via Symbolic Chain-of-thought2024 · 156 citations
- Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies2024 · 112 citations
- Improved Few-shot Jailbreaking Can Circumvent Aligned Language Models And Their Defenses2024 · 79 citations
- Swe-dev: Evaluating And Training Autonomous Feature-driven Software Development2025
- Spider2-v: How Far Are Multimodal Agents From Automating Data Science And Engineering Workflows?2024
- Nl2repo-bench: Towards Long-horizon Repository Generation Evaluation Of Coding Agents2025
- Dacomp: Benchmarking Data Agents Across The Full Data Intelligence Lifecycle2025
Topics