Awesome Papers
LLMsQuantumSimSearchAI4CodeAgentsCVRoboticsCyberAI4SciSpeechRLMMGenAIGraphTSRecSysFL

← authors · overview

Yuqing Yang

20 papers · 1568 citations
Most-cited papers
  • Longllmlingua: Accelerating And Enhancing Llms In Long Context Scenarios Via Prompt Compression
    2023 · 407 citations
  • Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention
    2024 · 309 citations
  • Llmlingua: Compressing Prompts For Accelerated Inference Of Large Language Models
    2023 · 227 citations
  • Parrot: Efficient Serving Of Llm-based Applications With Semantic Variable
    2024 · 105 citations
  • Retrievalattention: Accelerating Long-context LLM Inference Via Vector Retrieval
    2024 · 1 citations
Topics
EfficiencyModel ArchitectureIn-Context LearningPromptingAgenticImage Retrieval

Privacy · Terms

© 2026 Awesome Papers.