3rd International ICST Workshop on the Value of Security through Collaboration

Research Article

An Entropy based method for Measuring Anonymity

  • @INPROCEEDINGS{10.1109/SECCOM.2007.4550303,
        author={Michele Bezzi},
        title={An Entropy based method for Measuring Anonymity},
        proceedings={3rd International ICST Workshop on the Value of Security through Collaboration},
        publisher={IEEE},
        proceedings_a={SECOVAL},
        year={2008},
        month={6},
        keywords={Couplings  Databases  Entropy  Gaussian noise  Information resources  Loss measurement  Measurement uncertainty  Probability  Random processes  Testing},
        doi={10.1109/SECCOM.2007.4550303}
    }
    
  • Michele Bezzi
    Year: 2008
    An Entropy based method for Measuring Anonymity
    SECOVAL
    IEEE
    DOI: 10.1109/SECCOM.2007.4550303
Michele Bezzi1,*
  • 1: Accenture Technology Labs 449, route des Cretes Sophia Antipolis, France
*Contact email: michele.bezzi@accenture.com

Abstract

Data holders use data masking techniques for limiting disclosure risk in releasing sensitive datasets. Disclosure risk is often expressed in terms of rareness or of probability of re-identification. We propose a novel measure of disclosure risk, based on Shannon entropy, which combines together these two approaches. This measure represents the uncertainty of the linkage of the masked record with the original dataset, and so an estimation of the disclosure risk. It is also related to the size of the support of an equivalent random process with a uniform distribution. This allows us to define for any masking transformation an effective k value in analogy to k-anonymity measure used for integrity preserving transformations. Furthermore, this measure provides a direct link to the information loss in the transformations, providing some insights about the utility. We demonstrate this approach in a toy example using a dataset masked by adding Gaussian noise.