Awesome Similarity Search
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

Zirui Wang

11 papers Β· 0 citations
Most-cited papers
  • Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models
    2022 Β· 2373 citations
  • Ferret: Refer And Ground Anything Anywhere At Any Granularity
    2023 Β· 503 citations
  • MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training
    2024 Β· 261 citations
  • MM1.5: Methods, Analysis & Insights From Multimodal LLM Fine-tuning
    2024 Β· 70 citations
  • Tokencompose: Text-to-image Diffusion With Token-level Supervision
    2023 Β· 39 citations
  • MMAU: A Holistic Benchmark Of Agent Capabilities Across Diverse Domains
    2024
  • Smac-hard: Enabling Mixed Opponent Strategy Script And Self-play On SMAC
    2024
  • Veattack: Downstream-agnostic Vision Encoder Attack Against Large Vision Language Models
    2025
  • Cue3d: Quantifying The Role Of Image Cues In Single-image 3D Generation
    2025
  • MANZANO: A Simple And Scalable Unified Multimodal Model With A Hybrid Vision Tokenizer
    2025
  • Openvision 2: A Family Of Generative Pretrained Visual Encoders For Multimodal Learning
    2025
  • Mcpmark: A Benchmark For Stress-testing Realistic And Comprehensive MCP Use
    2025
Topics
Vision-LanguageModel ArchitectureTraining TechniquesBenchmarksVision-Language ModelsRAGEvaluationSafety & AlignmentSurvey PaperCode

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.