About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Intelligent Systems and Machine Learning. First EAI International Conference, ICISML 2022, Hyderabad, India, December 16-17, 2022, Proceedings, Part II

Research Article

A Novel Oversampling Technique for Imbalanced Credit Scoring Datasets

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-031-35081-8_12,
        author={Sudhansu Ranjan Lenka and Sukant Kishoro Bisoy and Rojalina Priyadarshini and Jhalak Hota},
        title={A Novel Oversampling Technique for Imbalanced Credit Scoring Datasets},
        proceedings={Intelligent Systems and Machine Learning. First EAI International Conference, ICISML 2022, Hyderabad, India, December 16-17, 2022, Proceedings, Part II},
        proceedings_a={ICISML PART 2},
        year={2023},
        month={7},
        keywords={Credit Scoring Imbalance Class Distribution Noisy Samples Oversampling Classification},
        doi={10.1007/978-3-031-35081-8_12}
    }
    
  • Sudhansu Ranjan Lenka
    Sukant Kishoro Bisoy
    Rojalina Priyadarshini
    Jhalak Hota
    Year: 2023
    A Novel Oversampling Technique for Imbalanced Credit Scoring Datasets
    ICISML PART 2
    Springer
    DOI: 10.1007/978-3-031-35081-8_12
Sudhansu Ranjan Lenka1,*, Sukant Kishoro Bisoy1, Rojalina Priyadarshini1, Jhalak Hota
  • 1: Computer Science and Engineering
*Contact email: sudhansulenka2000@gmail.com

Abstract

The imbalanced class distribution of credit-scoring datasets typically makes the learning algorithms ineffective. In this study, NOSTE is proposed, a novel oversampling technique. It first identifies the informative minority instances by eliminating the noisy samples from the minority subset. Then, weight is assigned to the informative minority instances by considering the density and distance factors. Finally, new minority instances are created by determining the average of two different minority instances to make the dataset balanced. In the experimental study, NOSTE performance is validated by conducting an extensive comparison with four popular oversampling methods using three credit-scoring datasets from the UCI repository. The results confirmed that the proposed method brings significant improvement in the classification in terms of F-measure and AUC (Area under the Curve).

Keywords
Credit Scoring Imbalance Class Distribution Noisy Samples Oversampling Classification
Published
2023-07-10
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-031-35081-8_12
Copyright © 2022–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL