Impact of Features Reduction on Machine Learning Based Intrusion Detection Systems

Masooma Fatima; Osama Rehman; Ibrahim M. H. Rahman

Research Article

Impact of Features Reduction on Machine Learning Based Intrusion Detection Systems

Download426 downloads

Cite: BibTeX Plain Text

@ARTICLE{10.4108/eetsis.vi.447,
    author={Masooma Fatima and Osama Rehman and Ibrahim M. H. Rahman},
    title={Impact of Features Reduction on Machine Learning Based Intrusion Detection Systems},
    journal={EAI Endorsed Transactions on Scalable Information Systems},
    volume={9},
    number={6},
    publisher={EAI},
    journal_a={SIS},
    year={2022},
    month={4},
    keywords={DDoS attacks, Random Forest, Na\~{n}ve Bayes, SVM, WEKA, IDS},
    doi={10.4108/eetsis.vi.447}
}

Masooma Fatima
Osama Rehman
Ibrahim M. H. Rahman
Year: 2022
Impact of Features Reduction on Machine Learning Based Intrusion Detection Systems
SIS
EAI
DOI: 10.4108/eetsis.vi.447

Masooma Fatima¹^,*, Osama Rehman², Ibrahim M. H. Rahman³

1: Systems Ltd, Karachi, Pakistan
2: Bahria University
3: Open Polytechnic

*Contact email: masoomafatima69@gmail.com

Abstract

INTRODUCTION: As the use of the internet is increasing rapidly, cyber-attacks over user’s personal data and network resources are on the rise. Due to the easily accessible cyber-attack tools, attacks on cyber resources are becoming common including Distributed Denial-of-Service (DDoS) attacks. Intruders are using enhanced techniques for executing DDoS attacks. OBJECTIVES: Machine Learning (ML) based classification modules integrated with Intrusion Detection System (IDS) has the potential to detect cyber-attacks. This research aims to study the performance of several machine learning algorithms, namely Naïve Bayes, Decision Tree, Random Forest, and Support Vector Machine in classifying DDoS attacks from normal traffic. METHODS: The paper focuses on DDoS attacks identification for which multiclass dataset is being used including Smurf, SIDDoS, HTTP-Flood and UDP-Flood. balanced datasets are used for both training and testing purposes in order to obtain biased free results. four experimental scenarios are conducted in which each experiment contains a different set of reduced features. RESULTS: Result of each experiment is computed individually and the best algorithm among the four is highlighted by mean of its accuracy, detection rates and processing time required to build and test the classifiers. CONCLUSION: Based on all experimental results, it is found that Decision Tree algorithm has shown promising cumulative performances in terms of the metrics investigated.

Keywords: DDoS attacks, Random Forest, Naïve Bayes, SVM, WEKA, IDS

Received: 2022-02-03
Accepted: 2022-03-31
Published: 2022-04-13
Publisher: EAI

: http://dx.doi.org/10.4108/eetsis.vi.447

Copyright © 2022 Masooma Fatima et al., licensed to EAI. This is an open access article distributed under the terms of the Creative Commons Attribution license, which permits unlimited use, distribution and reproduction in any medium so long as the original work is properly cited.