Awesome Programming Languages
Programming Languages is one of the most active areas in Awesome AI Agents — 39 papers in this collection, evaluated on datasets like API Design, BFCLv-3, BFCL v-4. A strong starting point is "Local Success Does Not Compose: Benchmarking Large Language Models for Compositional Formal Verification".
Datasets & benchmarks
Key papers
- Local Success Does Not Compose: Benchmarking Large Language Models for Compositional Formal Verification (2025)Xu Xu et al.8.37
- Towards Repository-Level Program Verification with Large Language Models (2025)Si Cheng Zhong et al.4.69
- AI-PROPELLER: Warehouse-Scale Interprocedural Code Layout Optimization with AlphaEvolve (2026)Chaitanya Mamatha Ananda et al.4.39
- SEMBridge: Tagless-Final Program Semantics with Weakest-Precondition and Bounded-Checking Interpretations (2026)Eric Liang4.39
- SNN-MLIR: An MLIR Dialect for Compiling Neuromorphic SNNs from NIR to Bare-Metal C (2026)Alejandro Garc\'ia Gener et al.4.39
- VERITAS: Verifier-Guided Proof Search for Zero-Shot Formal Theorem Proving (2026)Manish Acharya et al.4.39
- Stdlib or Third-Party? Empirical Performance and Correctness of LLM-Assisted Zero-Dependency Python Libraries (2026)Peng Ding et al.4.33
- Agentic Harness Engineering: Observability-Driven Automatic Evolution of Coding-Agent Harnesses (2026)Jiahang Lin et al.4.26
- Capability-driven Skill Generation With Llms: A Rag-based Approach For Reusing Existing Libraries And Interfaces (2025)Luis Miguel Vieira da Silva, Aljosha Köcher, Nicolas König, et al.3.86
- PromptMN: Pseudo Prompting Language (2026)Enkhzol Dovdon3.51
- SimpleTool: Parallel Decoding for Real-Time LLM Function Calling (2026)Xiao Shi et al.3.32
- Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs (2026)Yunhao Chen et al.2.00
- LEGO: An LLM Skill-Based Front-End Design Generation Platform (2026)Jincheng Lou et al.2.00
- PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control (2026)Jingxuan Wei et al.2.00
- The IsalProgram Programming Language (2026)Ezequiel L\'opez-Rubio2.00
- Towards Human-Level Book-Writing Capability (2026)Jan Zierstek et al.2.00
- Skvm: Revisiting Language VM For Skills Across Heterogenous Llms And Harnesses (2026)Le Chen, Erhu Feng, Yubin Xia, et al.2.00
- PPO Guided Agentic Pipeline For Adaptive Prompt Selection And Test Case Generation (2026)Gourisetty Venkata Sai Koushik, Dama Aditya, Mahankali Harish Sai, et al.2.00
- Quality-driven Agentic Reasoning For Llm-assisted Software Design: Questions-of-thoughts (qot) As A Time-series Self-qa Chain (2026)Yen-Ku Liu, Yun-Cheng Tsai2.00
- Localv: Exploiting Information Locality For Ip-level Verilog Generation (2026)Hanqi Lyu, di Huang, Yaoyu Zhu, et al.2.00
- Composer 2 Technical Report (2026)Cursor Research, :, Aaron Chan, et al.2.00
- Semia: Auditing Agent Skills Via Constraint-guided Representation Synthesis (2026)Hongbo Wen, Ying Li, Hanzhi Liu, et al.2.00
- HCAG: Hierarchical Abstraction And Retrieval-augmented Generation On Theoretical Repositories With Llms (2026)Yusen Wu, Xiaotie Deng2.00
- AHASD: Asynchronous Heterogeneous Architecture For LLM Adaptive Drafting Speculative Decoding On Mobile Devices (2026)Ma Zirui, Fan Zhihua, Li Wenxing, et al.2.00
- CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging (2026)Shiyang Li et al.1.94
- From Agent Loops to Structured Graphs:A Scheduler-Theoretic Framework for LLM Agent Execution (2026)HU Wei1.89
- Moded Types for Grassroots Logic Programs, by AI, for AI (Full Version) (2026)Ehud Shapiro1.72
- High-quality generation of dynamic game content via small language models: A proof of concept (2026)Morten I. K. Munk et al.1.72
- Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation (2025)Le Chen et al.1.67
- Generating Structured Plan Representation Of Procedures With Llms (2025)Deepeka Garg, Sihan Zeng, Sumitra Ganesh, et al.1.33
- From Natural Language To Solver-ready Power System Optimization: An Llm-assisted, Validation-in-the-loop Framework (2025)Yunkai Hu, Tianqiao Zhao, Meng Yue1.33
- Small Language Models For Agentic Systems: A Survey Of Architectures, Capabilities, And Deployment Trade Offs (2025)Raghav Sharma, Manan Mehta1.33
- Hypergraphos: A Meta Operating System For Science And Engineering (2024)Antonello Ceravola, Frank Joublin, Ahmed R. Sadik, et al.0.00
- Typefly: Flying Drones With Large Language Model (2023)Guojun Chen, Xiaojing Yu, Neiwen Ling, et al.0.00
- APPL: A Prompt Programming Language For Harmonious Integration Of Programs And Large Language Model Prompts (2024)Honghua Dong, Qidong Su, Yubo Gao, et al.0.00
- Hammer: Robust Function-calling For On-device Language Models Via Function Masking (2024)Qiqiang Lin, Muning Wen, Qiuying Peng, et al.0.00
- Askit: Unified Programming Interface For Programming With Large Language Models (2023)Katsumi Okuda, Saman Amarasinghe0.00
- CELI: Controller-embedded Language Model Interactions (2024)Jan-Samuel Wagner, Dave Decaprio, Abishek Chiffon Muthu Raja, et al.0.00
- Stateflow: Enhancing LLM Task-solving Through State-driven Workflows (2024)Yiran Wu, Tianwei Yue, Shaokun Zhang, et al.0.00