3rd International ICST Conference on Scalable Information Systems

Research Article

Approximate similarity search in metric spaces using inverted files

Download710 downloads
  • @INPROCEEDINGS{10.4108/ICST.INFOSCALE2008.3486,
        author={Giuseppe  Amato and Pasquale  Savino},
        title={Approximate similarity search in metric spaces using inverted files},
        proceedings={3rd International ICST Conference on Scalable Information Systems},
        publisher={ICST},
        proceedings_a={INFOSCALE},
        year={2010},
        month={5},
        keywords={},
        doi={10.4108/ICST.INFOSCALE2008.3486}
    }
    
  • Giuseppe Amato
    Pasquale Savino
    Year: 2010
    Approximate similarity search in metric spaces using inverted files
    INFOSCALE
    ICST
    DOI: 10.4108/ICST.INFOSCALE2008.3486
Giuseppe Amato1,*, Pasquale Savino1,*
  • 1: ISTI-CNR, Via G. Moruzzi, 1, 56124, Pisa, Italy
*Contact email: giuseppe.amato@isti.cnr.it, pasquale.savino@isti.cnr.it

Abstract

We propose a new approach to perform approximate similarity search in metric spaces. The idea at the basis of this technique is that when two objects are very close one to each other they 'see' the world around them in the same way. Accordingly, we can use a measure of dissimilarity between the view of the world, from the perspective of the two objects, in place of the distance function of the underlying metric space. To exploit this idea we represent each object of a dataset by the ordering of a number of reference objects of the metric space according to their distance from the object itself. In order to compare two objects of the dataset we compare the two corresponding orderings of the reference objects. We show that efficient and effective approximate similarity searching can be obtained by using inverted files, relying on this idea. We show that the proposed approach performs better than other approaches in literature.