Ji Zhang
13 papers ยท 9 citations
Most-cited papers
- X-CLIP: End-to-end Multi-grained Contrastive Learning For Video-text Retrieval2022 ยท 260 citations
- Mplug-docowl 1.5: Unified Structure Learning For Ocr-free Document Understanding2024 ยท 237 citations
- AMBER: An Llm-free Multi-dimensional Benchmark For Mllms Hallucination Evaluation2023 ยท 227 citations
- Hallucination Augmented Contrastive Learning For Multimodal Large Language Model2023 ยท 144 citations
- Ureader: Universal Ocr-free Visually-situated Language Understanding With Multimodal Large Language Model2023 ยท 143 citations
- Mplug: Effective And Efficient Vision-language Learning By Cross-modal Skip-connections2022 ยท 141 citations
- From Global To Local: Multi-scale Out-of-distribution Detection2023 ยท 31 citations
- Small Llms Are Weak Tool Learners: A Multi-llm Agent2024 ยท 22 citations
- Isimloc: Visual Global Localization For Previously Unseen Environments With Simulated Images2022 ยท 20 citations
- Isimloc: Visual Global Localization For Previously Unseen Environments With Simulated Images2022 ยท 20 citations
- SHREC'22 Track: Sketch-based 3D Shape Retrieval In The Wild2022 ยท 13 citations
- Mibench: Evaluating Multimodal Large Language Models Over Multiple Images2024 ยท 10 citations
- Reliable Few-shot Learning Under Dual Noises2025 ยท 9 citations
- Reliable Few-shot Learning Under Dual Noises2025 ยท 9 citations
- Modelscope-agent: Building Your Customizable Agent System With Open-source Large Language Models2023 ยท 8 citations
Topics