
Research Article
A Study on Android Malware Detection Using Machine Learning Algorithms
@INPROCEEDINGS{10.1007/978-3-031-35078-8_20, author={K. S. Ujjwal Reddy and S. Sibi Chakkaravarthy and M. Gopinath and Aditya Mitra}, title={A Study on Android Malware Detection Using Machine Learning Algorithms}, proceedings={Intelligent Systems and Machine Learning. First EAI International Conference, ICISML 2022, Hyderabad, India, December 16-17, 2022, Proceedings, Part I}, proceedings_a={ICISML}, year={2023}, month={7}, keywords={Android malware Malware detection Permissions Activities Smartphone protection Machine learning}, doi={10.1007/978-3-031-35078-8_20} }
- K. S. Ujjwal Reddy
S. Sibi Chakkaravarthy
M. Gopinath
Aditya Mitra
Year: 2023
A Study on Android Malware Detection Using Machine Learning Algorithms
ICISML
Springer
DOI: 10.1007/978-3-031-35078-8_20
Abstract
Today, Android has become the most popular operating system because of its salient features. As it is an open-source mobile OS, several developers are developing and publishing their android applications. On the other side, attackers are manipulating those applications in the form of malicious software (Malware) by leveraging the application or functional flow of android OS and those malwares create loss or leakage of confidential sensitive information. Though most anti-virus software affords defence against malware attacks, still the attacks are highly possible in the real time adversarial environment. In this paper, the machine learning-based detection method is designed by combining the features of application namely permission and activity which are obtained during the installation of apps. In our design, permissions and activities of each app are extracted making use of Androguard tool. Using this feature combination, malicious apps are classified as either benign or malicious. The advantage of this method is that there is no need for any dynamic analysis. In our experimentation, we used real-world app samples with 500 malware and 500 benign to train the algorithm for better performance. Based on the experimentation results, highest detection rate is attained by Random Forest (RF) with 95% of accuracy and lowest detection rate is obtained by K-Nearest Neighbors (KNN) with 79% of accuracy.