OCR
Emerging4papers using it
2025first seen
Papers using OCR (4)
- Text-Guided Layer Fusion Mitigates Hallucination in Multimodal LLMsInverse-LLaVA: Eliminating Alignment Pre-training Through Text-to-Vision MappingInvestigating Redundancy In Multimodal Large Language Models With Multiple Vision EncodersOptmerge: Unifying Multimodal LLM Capabilities And Modalities Via Model Merging