Jan Kautz
11 papers · 0 citations
Most-cited papers
- Multimodal Unsupervised Image-to-image Translation2018 · 1582 citations
- VILA: On Pre-training For Visual Language Models2023 · 803 citations
- Few-shot Unsupervised Image-to-image Translation2019 · 465 citations
- Foundationpose: Unified 6D Pose Estimation And Tracking Of Novel Objects2023 · 238 citations
- Extreme View Synthesis2018 · 148 citations
- Compact Language Models Via Pruning And Knowledge Distillation2024 · 140 citations
- Eagle: Exploring The Design Space For Multimodal Llms With Mixture Of Encoders2024 · 132 citations
- LITA: Language Instructed Temporal-localization Assistant2024 · 126 citations
- Contrastive Learning For Weakly Supervised Phrase Grounding2020 · 84 citations
- Toolorchestra: Elevating Intelligence Via Efficient Model And Tool Orchestration2025
- Scaling RL To Long Videos2025
- Adahuman: Animatable Detailed 3D Human Generation With Compositional Multiview Diffusion2025
- Dreamgen: Unlocking Generalization In Robot Learning Through Video World Models2025
- Geoman: Temporally Consistent Human Geometry Estimation Using Image-to-video Diffusion2025
- Nitrogen: An Open Foundation Model For Generalist Gaming Agents2026
Topics