Jiayi Ji
12 papers · 2 citations
Most-cited papers
- Dual-level Collaborative Transformer For Image Captioning2021 · 277 citations
- Beat: Bi-directional One-to-many Embedding Alignment For Text-based Person Retrieval2024 · 27 citations
- Mihbench: Benchmarking And Mitigating Multi-image Hallucinations In Multimodal Large Language Models2025 · 1 citations
- Aigi-holmes: Towards Explainable And Generalizable Ai-generated Image Detection Via Multimodal Large Language Models2025 · 1 citations
- Evolving, Not Training: Zero-shot Reasoning Segmentation Via Evolutionary Prompting2025
- Mdreid: Modality-decoupled Learning For Any-to-any Multi-modal Object Re-identification2025
- Mdreid: Modality-decoupled Learning For Any-to-any Multi-modal Object Re-identification2025
- Space-10: A Comprehensive Benchmark For Multimodal Large Language Models In Compositional Spatial Intelligence2025
- Hieravid: Hierarchical Token Pruning For Fast Video Large Language Models2026
- Pixdlm: A Dual-path Multimodal Language Model For UAV Reasoning Segmentation2026
- Cir-cot: Towards Interpretable Composed Image Retrieval Via End-to-end Chain-of-thought Reasoning2025
- CSMCIR: Cot-enhanced Symmetric Alignment With Memory Bank For Composed Image Retrieval2026
- CSMCIR: Cot-enhanced Symmetric Alignment With Memory Bank For Composed Image Retrieval2026
- MVGGT: Multimodal Visual Geometry Grounded Transformer For Multiview 3D Referring Expression Segmentation2026
Topics