EgoSchema
Emerging7papers using it
2024first seen
EgoSchema is a benchmark dataset used to evaluate the performance of models in understanding and extracting relevant information from long videos.
Papers using EgoSchema (7)
- Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMsProgressive Video Condensation with MLLM Agent for Long-form Video UnderstandingReDiPrune: Relevance-Diversity Pre-Projection Token Pruning for Efficient Multimodal LLMsCASHEW: Stabilizing Multimodal Reasoning via Iterative Trajectory AggregationEgovlm: Policy Optimization For Egocentric Video UnderstandingVideoAgent: Long-form Video Understanding with Large Language Model as
AgentLanguage Repository for Long Video Understanding