Mohit Bansal
19 papers ยท 3 citations
Most-cited papers
- Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models2022 ยท 2373 citations
- Less Is More: Clipbert For Video-and-language Learning Via Sparse Sampling2021 ยท 468 citations
- Analyzing And Mitigating Object Hallucination In Large Vision-language Models2023 ยท 312 citations
- Reconcile: Round-table Conference Improves Reasoning Via Consensus Among Diverse Llms2023 ยท 288 citations
- Vl-adapter: Parameter-efficient Transfer Learning For Vision-and-language Tasks2021 ยท 242 citations
- Fine-grained Image Captioning With CLIP Reward2022 ยท 53 citations
- Hierarchical Video-moment Retrieval And Step-captioning2023 ยท 46 citations
- Adapt: As-needed Decomposition And Planning With Language Models2023 ยท 20 citations
- Contrastive Region Guidance: Improving Grounding In Vision-language Models Without Training2024 ยท 12 citations
- Diagnostic Benchmark And Iterative Inpainting For Layout-guided Image Generation2023 ยท 6 citations
- The Amazon Nova Family Of Models: Technical Report And Model Card2025 ยท 2 citations
- Motion-grounded Video Reasoning: Understanding And Perceiving Motion At Pixel Level2024 ยท 2 citations
- Video-skill-cot: Skill-based Chain-of-thoughts For Domain-adaptive Video Reasoning2025 ยท 1 citations
- Glider: Global And Local Instruction-driven Expert Router2024 ยท 1 citations
- Posh: Using Scene Graphs To Guide Llms-as-a-judge For Detailed Image Descriptions2025
Topics