Descriptor Transition Tables For Object Retrieval Using Unconstrained Cluttered Video Acquired Using A Consumer Level Handheld Mobile Device
2016 Β· Warren Rieutort-Louis, Ognjen Arandjelovic
Abstract
Visual recognition and vision based retrieval of objects from large databases are tasks with a wide spectrum of potential applications. In this paper we propose a novel recognition method from video sequences suitable for retrieval from databases acquired in highly unconstrained conditions e.g. using a mobile consumer-level device such as a phone. On the lowest level, we represent each sequence as a 3D mesh of densely packed local appearance descriptors. While image plane geometry is captured implicitly by a large overlap of neighbouring regions from which the descriptors are extracted, 3D information is extracted by means of a descriptor transition table, learnt from a single sequence for each known gallery object. These allow us to connect local descriptors along the 3rd dimension (which corresponds to viewpoint changes), thus resulting in a set of variable length Markov chains for each video. The matching of two sets of such chains is formulated as a statistical hypothesis test, whe
Authors
(none)
Tags
Stats
Related papers
- Search Tracker: Human-derived Object Tracking In-the-wild Through Large-scale Search And Retrieval (2016)5.24
- Object-centric Framework For Video Moment Retrieval (2025)0.00
- Exploiting Local Indexing And Deep Feature Confidence Scores For Fast Image-to-video Search (2018)2.26
- Dynamic Gesture Retrieval: Searching Videos By Human Pose Sequence (2020)0.00
- LOVO: Efficient Complex Object Query In Large-scale Video Datasets (2025)2.26
- Visual Appearance Based Person Retrieval In Unconstrained Environment Videos (2019)8.35
- HVD: Human Vision-driven Video Representation Learning For Text-video Retrieval (2026)0.00
- Graph-based Non-linear Least Squares Optimization For Visual Place Recognition In Changing Environments (2020)7.16