Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Chen Li

31 papers · 3 citations
Most-cited papers
  • Mplug-docowl 1.5: Unified Structure Learning For Ocr-free Document Understanding
    2024 · 237 citations
  • Making Llama SEE And Draw With SEED Tokenizer
    2023 · 203 citations
  • ST-LLM: Large Language Models Are Effective Temporal Learners
    2024 · 136 citations
  • FP8-LM: Training FP8 Large Language Models
    2023 · 76 citations
  • Bt-adapter: Video Conversation Is Feasible Without Video Instruction Tuning
    2023 · 45 citations
  • Semantic Attention And Scale Complementary Network For Instance Segmentation In Remote Sensing Images
    2021 · 44 citations
  • Vila-mil: Dual-scale Vision-language Multiple Instance Learning For Whole Slide Image Classification
    2025 · 41 citations
  • Multi-view Attentive Contextualization For Multi-view 3D Object Detection
    2024 · 7 citations
  • Ghunerf: Generalizable Human Nerf From A Monocular Video
    2023 · 5 citations
  • Reconstructing Close Human Interaction With Appearance And Proxemics Reasoning
    2025 · 3 citations
  • Contourformer: Real-time Contour-based End-to-end Instance Segmentation Transformer
    2025 · 2 citations
  • Ppllava: Varied Video Sequence Understanding With Prompt Guidance
    2026
  • Faithscan: Model-driven Single-pass Hallucination Detection For Faithful Visual Question Answering
    2026
Topics
Vision-LanguageModel ArchitectureTraining Techniques3D VisionIn-Context LearningEfficiencySegmentationVisual QA & ReasoningVision-Language ModelsCode

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.