Deepsketch: A New Machine Learning-based Reference Search Technique For Post-deduplication Delta Compression
2022 Β· Jisung Park, Jeoggyun Kim, Yeseong Kim, et al.
Abstract
Data reduction in storage systems is becoming increasingly important as an effective solution to minimize the management cost of a data center. To maximize data-reduction efficiency, existing post-deduplication delta-compression techniques perform delta compression along with traditional data deduplication and lossless compression. Unfortunately, we observe that existing techniques achieve significantly lower data-reduction ratios than the optimal due to their limited accuracy in identifying similar data blocks. In this paper, we propose DeepSketch, a new reference search technique for post-deduplication delta compression that leverages the learning-to-hash method to achieve higher accuracy in reference search for delta compression, thereby improving data-reduction efficiency. DeepSketch uses a deep neural network to extract a data block's sketch, i.e., to create an approximate data signature of the block that can preserve similarity with other blocks. Our evaluation using eleven rea
Authors
(none)
Tags
Stats
Related papers
- Sketchmate: Deep Hashing For Million-scale Human Sketch Retrieval (2018)15.03
- Sketchcleannet -- A Deep Learning Approach To The Enhancement And Correction Of Query Sketches For A 3D CAD Model Retrieval System (2022)9.03
- Deep Sketch Hashing: Fast Free-hand Sketch-based Image Retrieval (2017)17.49
- Sketch Down The Flops: Towards Efficient Networks For Human Sketch (2025)0.00
- Deepssn: A Deep Convolutional Neural Network To Assess Spatial Scene Similarity (2022)8.09
- 'cadsketchnet' -- An Annotated Sketch Dataset For 3D CAD Model Retrieval With Deep Neural Networks (2021)11.19
- Making Online Sketching Hashing Even Faster (2020)9.23
- Sketching Without Worrying: Noise-tolerant Sketch-based Image Retrieval (2022)12.74