Zhenguo Li
15 papers · 776 citations
Most-cited papers
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation2024 · 284 citations
- Eyes Closed, Safety On: Protecting Multimodal Llms Via Image-to-text Transformation2024 · 123 citations
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation2024 · 120 citations
- Genartist: Multimodal LLM As An Agent For Unified Image Generation And Editing2024 · 119 citations
- Unitr: A Unified And Efficient Multi-modal Transformer For Bird's-eye-view Representation2023 · 79 citations
- Detclipv3: Towards Versatile Generative Open-vocabulary Object Detection2024 · 55 citations
- Gaining Wisdom From Setbacks: Aligning Large Language Models Via Mistake Analysis2023 · 47 citations
- Contranerf: Generalizable Neural Radiance Fields For Synthetic-to-real Novel View Synthesis Via Contrastive Learning2023 · 20 citations
- Metaaugment: Sample-aware Data Augmentation Policy Learning2020 · 16 citations
- Generative Negative Text Replay For Continual Vision-language Pretraining2022 · 13 citations
Topics