LLaVA-1.5
Emerging5papers using it
2024first seen
Papers using LLaVA-1.5 (5)
- AdaIAT: Adaptively Increasing Attention to Generated Text to Alleviate Hallucinations in LVLMPrune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-DiversityConsensusDrop: Fusing Visual and Cross-Modal Saliency for Efficient Vision Language ModelsWatch Wider and Think Deeper: Collaborative Cross-modal Chain-of-Thought for Complex Visual ReasoningModel Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large
Language Models