About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Communications and Networking. 14th EAI International Conference, ChinaCom 2019, Shanghai, China, November 29 – December 1, 2019, Proceedings, Part II

Research Article

A Classifier Combining Local Distance Mean and Centroid for Imbalanced Datasets

Download(Requires a free EAI acccount)
2 downloads
Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-030-41117-6_11,
        author={Yingying Zhao and Xingcheng Liu},
        title={A Classifier Combining Local Distance Mean and Centroid for Imbalanced Datasets},
        proceedings={Communications and Networking. 14th EAI International Conference, ChinaCom 2019, Shanghai, China, November 29 -- December 1, 2019, Proceedings, Part II},
        proceedings_a={CHINACOM PART 2},
        year={2020},
        month={2},
        keywords={K-Nearest Neighbor (KNN) Local distance mean Centroid Classes imbalance Classifier},
        doi={10.1007/978-3-030-41117-6_11}
    }
    
  • Yingying Zhao
    Xingcheng Liu
    Year: 2020
    A Classifier Combining Local Distance Mean and Centroid for Imbalanced Datasets
    CHINACOM PART 2
    Springer
    DOI: 10.1007/978-3-030-41117-6_11
Yingying Zhao1, Xingcheng Liu1,*
  • 1: School of Electronics and Information Technology, Sun Yat-sen University
*Contact email: isslxc@mail.sysu.edu.cn

Abstract

The K-Nearest Neighbor (KNN) algorithm is widely used in practical life because of its simplicity and easy understanding. However, the traditional KNN algorithm has some shortcomings. It only considers the number of samples of different classes in k neighbors, but ignores the distance and location distribution of the unknown sample relative to the k nearest training samples. Moreover, classes imbalance problem is always a challenge faced with the KNN algorithm. To solve the above problems, we propose an improved KNN classification method for classes imbalanced datasets based on local distance mean and centroid (LDMC-KNN) in this paper. In the proposed scheme, different numbers of nearest neighbor training samples are selected from each class, and the unknown sample is classified according to the distance and position of these nearest training samples. Experiments are performed on the UCI datasets. The results show that the proposed algorithm has strong competitiveness and is always far superior to KNN algorithm and its variants.

Keywords
K-Nearest Neighbor (KNN) Local distance mean Centroid Classes imbalance Classifier
Published
2020-02-27
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-030-41117-6_11
Copyright © 2019–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL