Awesome Speech Audio
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Hao Fei

14 papers · 5 citations
Most-cited papers
  • Next-gpt: Any-to-any Multimodal LLM
    2023 · 780 citations
  • LL3DA: Visual Interactive Instruction Tuning For Omni-3d Understanding, Reasoning, And Planning
    2023 · 216 citations
  • Faithful Logical Reasoning Via Symbolic Chain-of-thought
    2024 · 156 citations
  • Omg-llava: Bridging Image-level, Object-level, Pixel-level Reasoning And Understanding
    2024 · 150 citations
  • Layoutllm-t2i: Eliciting Layout Guidance From LLM For Text-to-image Generation
    2023 · 141 citations
  • Vitcot: Video-text Interleaved Chain-of-thought For Boosting Video Understanding In Large Language Models
    2025 · 4 citations
  • Leaf-mamba: Local Emphatic And Adaptive Fusion State Space Model For RGB-D Salient Object Detection
    2025 · 1 citations
  • MCM-DPO: Multifaceted Cross-modal Direct Preference Optimization For Alt-text Generation
    2025
  • Visual Thoughts: A Unified Perspective Of Understanding Multimodal Chain-of-thought
    2025
  • Samtok: Representing Any Mask With Two Words
    2026
Topics
Model ArchitectureTraining TechniquesVision-LanguageVision-Language ModelsPromptingVisual QA & ReasoningIn-Context LearningFine-TuningSafety & AlignmentVideo-Language

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.