Self-supervised Video Hashing With Hierarchical Binary Auto-encoder
2018 Β· Jingkuan Song, Hanwang Zhang, Xiangpeng Li, et al.
Abstract
Existing video hash functions are built on three isolated stages: frame pooling, relaxed learning, and binarization, which have not adequately explored the temporal order of video frames in a joint binary optimization model, resulting in severe information loss. In this paper, we propose a novel unsupervised video hashing framework dubbed Self-Supervised Video Hashing (SSVH), that is able to capture the temporal nature of videos in an end-to-end learning-to-hash fashion. We specifically address two central problems: 1) how to design an encoder-decoder architecture to generate binary codes for videos; and 2) how to equip the binary codes with the ability of accurate video retrieval. We design a hierarchical binary autoencoder to model the temporal dependencies in videos with multiple granularities, and embed the videos into binary codes with less computations than the stacked architecture. Then, we encourage the binary codes to simultaneously reconstruct the visual content and neighborh
Authors
(none)
Tags
Stats
Related papers
- Autossvh: Exploring Automated Frame Sampling For Efficient Self-supervised Video Hashing (2025)7.92
- CHAIN: Exploring Global-local Spatio-temporal Information For Improved Self-supervised Video Hashing (2023)8.60
- Dual-stream Knowledge-preserving Hashing For Unsupervised Video Retrieval (2023)9.23
- Encode The Unseen: Predictive Video Hashing For Scalable Mid-stream Retrieval (2020)3.58
- Unsupervised Deep Hashing For Large-scale Visual Search (2016)9.59
- Deep Supervised Discrete Hashing (2017)0.00
- Auto-encoding Twin-bottleneck Hashing (2020)19.34
- Video Retrieval Based On Deep Convolutional Neural Network (2017)9.03