About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Advanced Hybrid Information Processing. 4th EAI International Conference, ADHIP 2020, Binzhou, China, September 26-27, 2020, Proceedings, Part I

Research Article

Efficient Feature Selection Algorithm for High-Dimensional Non-equilibrium Big Data Set

Download(Requires a free EAI acccount)
2 downloads
Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-030-67871-5_36,
        author={Shuang-cheng Jia and Feng-ping Yang},
        title={Efficient Feature Selection Algorithm for High-Dimensional Non-equilibrium Big Data Set},
        proceedings={Advanced Hybrid Information Processing. 4th EAI International Conference, ADHIP 2020, Binzhou, China, September 26-27, 2020, Proceedings, Part I},
        proceedings_a={ADHIP},
        year={2021},
        month={2},
        keywords={High dimensional data Non-equilibrium feature Granulation fusion Feature selection},
        doi={10.1007/978-3-030-67871-5_36}
    }
    
  • Shuang-cheng Jia
    Feng-ping Yang
    Year: 2021
    Efficient Feature Selection Algorithm for High-Dimensional Non-equilibrium Big Data Set
    ADHIP
    Springer
    DOI: 10.1007/978-3-030-67871-5_36
Shuang-cheng Jia1,*, Feng-ping Yang1
  • 1: Alibaba Network Technology Co., Ltd.
*Contact email: xindine30@163.com

Abstract

When the traditional algorithm is used to calculate the feature classification of high-dimensional non-equilibrium and large data set, it is easy to appear the problem of low accuracy and recall rate of feature selection. Therefore, a feature selection algorithm based on granular fusion is designed. By using the regularization feature of the data, the original big data aggregate is transformed into a small-scale data subset. On the basis of this, the feature selection function of the data particle is obtained. Finally, the weight fusion calculation of each feature subset is carried out. The feature classification of high-dimensional non-equilibrium big data set is realized. The experimental results show that the feature selection algorithm based on granular fusion can realize the feature selection and recall of high dimensional unbalanced data sets. The accuracy of the method is higher than that of the traditional method, which shows that the method is feasible and effective.

Keywords
High dimensional data Non-equilibrium feature Granulation fusion Feature selection
Published
2021-02-03
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-030-67871-5_36
Copyright © 2020–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL