
Research Article
The Impact of Similarity Functions on Divisive Analysis Clustering of Tuberculosis Disease
@INPROCEEDINGS{10.4108/eai.16-9-2025.2361147, author={Yulita Molliq Rangkuti and Mansur AS and Agus Junaidi and Nurul Maulida Surbakti and Rizky Gunawan}, title={The Impact of Similarity Functions on Divisive Analysis Clustering of Tuberculosis Disease}, proceedings={Proceedings of the 7th International Conference on Innovation in Education, Science, and Culture, ICIESC 2025, 16 September 2025, Medan, Indonesia}, publisher={EAI}, proceedings_a={ICIESC}, year={2026}, month={3}, keywords={tuberculosis; diana; davies bouldin index; similarity function}, doi={10.4108/eai.16-9-2025.2361147} }- Yulita Molliq Rangkuti
Mansur AS
Agus Junaidi
Nurul Maulida Surbakti
Rizky Gunawan
Year: 2026
The Impact of Similarity Functions on Divisive Analysis Clustering of Tuberculosis Disease
ICIESC
EAI
DOI: 10.4108/eai.16-9-2025.2361147
Abstract
Tuberculosis is a bacterial infection that usually affects the lungs but can also affect other parts of the body. It is contagious and spreads through the air when an infected person coughs, sneezes or sings. North Sumatra ranks as one of the three Indonesian provinces that experience the highest rates of both occurrence and death. Tracking Instances of Tuberculosis is crucial for managing and averting the spread of the illness. The Divisive Analysis (DIANA) algorithm is frequently utilized to categorize Tuberculosis cases. DIANA operates as a clustering algorithm that organizes items into sets based on their similarities. The study emphasizes evaluating the effectiveness of various similarity functions. The dataset comprises factors such as mortality rates, infection rates, and recovery rates sourced from the North Sumatra Provincial Health Office and the Central Statistics Agency (BPS). The findings indicated the emergence of four clusters within North Sumatra Province. Furthermore, an assessment was performed employing the Davies Bouldin Index (DBI) to assess the quality of clustering. By comparing various distance metrics (Bray Curtis distance, Chebyshev distance, and Canberra distance), the lowest DBI score was reached with Chebyshev distance, yielding a value of 0.5121.


