Qi Liu
16 papers · 5 citations
Most-cited papers
- Soulchat: Improving Llms' Empathy, Listening, And Comfort Abilities Through Fine-tuning With Multi-turn Empathy Conversations2023 · 154 citations
- Docpedia: Unleashing The Power Of Large Multimodal Model In The Frequency Domain For Versatile Document Understanding2023 · 123 citations
- Chartx & Chartvlm: A Versatile Benchmark And Foundation Model For Complicated Chart Reasoning2024 · 122 citations
- Not All Experts Are Equal: Efficient Expert Pruning And Skipping For Mixture-of-experts Large Language Models2024 · 97 citations
- Tabpedia: Towards Comprehensive Visual Table Understanding With Concept Synergy2024 · 71 citations
- Timechat-online: 80% Visual Tokens Are Naturally Redundant In Streaming Videos2025 · 4 citations
- Daocc: 3D Object Detection Assisted Multi-sensor Fusion For 3D Occupancy Prediction2024 · 3 citations
- SAEN-BGS: Energy-efficient Spiking Autoencoder Network For Background Subtraction2025 · 3 citations
- WDMIR: Wavelet-driven Multimodal Intent Recognition2025 · 1 citations
- What-meets-where: Unified Learning Of Action And Contact Localization In Images2025 · 1 citations
- What-meets-where: Unified Learning Of Action And Contact Localization In Images2025 · 1 citations
- Os-sentinel: Towards Safety-enhanced Mobile GUI Agents Via Hybrid Validation In Realistic Workflows2025
- Perception-r1: Advancing Multimodal Reasoning Capabilities Of Mllms Via Visual Perception Reward2025
- Deepscan: A Training-free Framework For Visually Grounded Reasoning In Large Vision-language Models2026
- Verified Critical Step Optimization For LLM Agents2026
Topics