Prototype-Based Sample Selection for Active Hashing
- 1 Chungnam National University, Korea
Abstract
Several hashing-based methods for Approximate Nearest Neighbors (ANN) search in a large data set have been proposed recently. In particular, semi-supervised hashing utilizes semantic similarity given for a small fraction of pairwise data samples and active hashing aims to improve the performance for ANN search by relying on an expert for the labeling of the most informative points. In this study, we present an active hashing method by prototype-based sample selection. Knowing semantic similarities between cluster prototypes can help extracting relations among the points in the corresponding clusters. For expert labeling, we select prototypes from clusters which do not contain any data points with labeled information so that all areas can be covered effectively. Experimental results demonstrate that the proposed active hashing method improves the performance for ANN search.
DOI: https://doi.org/10.3844/jcssp.2015.839.844
Copyright: © 2015 Cheong Hee Park. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 3,013 Views
- 2,083 Downloads
- 0 Citations
Download
Keywords
- Active Hashing
- Approximate Nearest Neighbors (ANN) Search
- Hierarchical Clustering
- Prototype-Based Sample Selection
- Semi-Supervised Hashing