VATEX
Emerging3papers using it
2023first seen
Papers using VATEX (3)
- ABMAMBA: Multimodal Large Language Model with Aligned Hierarchical Bidirectional Scan for Efficient Video CaptioningGAIS: Frame-level Gated Audio-visual Integration With Semantic Variance-scaled Perturbation For Text-video RetrievalCL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual
Knowledge Transfer