About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Big Data Technologies and Applications. 11th and 12th EAI International Conference, BDTA 2021 and BDTA 2022, Virtual Event, December 2021 and 2022, Proceedings

Research Article

A Comparative Study for Anonymizing Datasets with Multiple Sensitive Attributes and Multiple Records

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-031-33614-0_3,
        author={Mona Mohamed Nasr and Hayam Mohamed Sayed and Waleed Mahmoud Ead},
        title={A Comparative Study for Anonymizing Datasets with Multiple Sensitive Attributes and Multiple Records},
        proceedings={Big Data Technologies and Applications. 11th and 12th EAI International Conference, BDTA 2021 and BDTA 2022, Virtual Event, December 2021 and 2022, Proceedings},
        proceedings_a={BDTA},
        year={2023},
        month={5},
        keywords={Privacy Anonymization Healthcare Data publishing MSA PPDP IOT},
        doi={10.1007/978-3-031-33614-0_3}
    }
    
  • Mona Mohamed Nasr
    Hayam Mohamed Sayed
    Waleed Mahmoud Ead
    Year: 2023
    A Comparative Study for Anonymizing Datasets with Multiple Sensitive Attributes and Multiple Records
    BDTA
    Springer
    DOI: 10.1007/978-3-031-33614-0_3
Mona Mohamed Nasr1, Hayam Mohamed Sayed1, Waleed Mahmoud Ead1,*
  • 1: Information Systems Department, Faculty of Computers and Artificial Intelligence
*Contact email: waleedead@fcis.bsu.edu.eg

Abstract

Today, there are many sources of data, such as IoT devices, that produce a massive amount of data, particularly in the healthcare industry. This microdata needs to be published, and shared for medical research purposes, data analysis, mining, learning analytics tasks, and the decision-making process. But this published data contains sensitive and private information for individuals, and if this microdata is published in its original format, the privacy of individuals may be disclosed, which puts the individuals at risk, especially if an adversary has strong background knowledge about the target individual. Owning multiple records and multiple sensitive attributes (MSA) for an individual can lead to new privacy leakages or disclosure. So, the fundamental issue is how to protect the privacy of 1:M with the MSA dataset using anonymization techniques and methods, as well as how to balance utility and privacy, for this data while reducing information loss and misuse. The objective of this paper is to use different methods and different anonymization algorithms, like the 1:m-generalization algorithm and Mondrian, and compare them to show which of them maintains data privacy and high utility of analysis results at the same time. From this comparison, we found that the m-generalization algorithm and the (p, k) angelization method perform well in terms of information loss and data utility compared to the other remaining methods and algorithms.

Keywords
Privacy Anonymization Healthcare Data publishing MSA PPDP IOT
Published
2023-05-26
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-031-33614-0_3
Copyright © 2021–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL