
Research Article
Metaheuristics-Based Hyperparameter Tuning for Convolutional Neural Networks
@INPROCEEDINGS{10.1007/978-3-031-55993-8_4, author={Tong Van Luyen and Nguyen Van Cuong}, title={Metaheuristics-Based Hyperparameter Tuning for Convolutional Neural Networks}, proceedings={Ad Hoc Networks. 14th EAI International Conference, AdHocNets 2023, Hanoi, Vietnam, November 10-11, 2023, Proceedings}, proceedings_a={ADHOCNETS}, year={2024}, month={3}, keywords={Hyperparameter optimization Convolutional neural networks Binary bat algorithm Metaheuristic algorithms Handwritten Chinese character classification}, doi={10.1007/978-3-031-55993-8_4} }
- Tong Van Luyen
Nguyen Van Cuong
Year: 2024
Metaheuristics-Based Hyperparameter Tuning for Convolutional Neural Networks
ADHOCNETS
Springer
DOI: 10.1007/978-3-031-55993-8_4
Abstract
Convolutional neural networks have made remarkable strides in the field of deep learning, achieving outstanding successes. However, to ensure the efficiency and high performance of these networks, it is crucial to optimize their hyperparameters. This paper presents a novel approach that focuses on optimizing hyperparameters for convolutional neural networks. The proposed approach leverages the binary bat algorithm, which is recognized as one of the most efficient algorithms among nature-inspired metaheuristic algorithms. By utilizing this approach, a set of optimal hyperparameters can be obtained, enabling the construction of convolutional neural network models that exhibit superior performance for specific applications. To demonstrate the effectiveness of this approach, the study employs it to determine hyperparameters such as the learning rate of optimizers and the number of filters in each convolutional layer. The objective is to build optimal models for the task of handwritten Chinese character classification. The empirical results obtained demonstrate the remarkable capabilities of the proposed approach. The models generated through this method exhibit higher performance in terms of classification accuracy and convergence ability when compared to the LeNet-5 model, as well as models based on Hyperband, Random Search, and Bayesian Optimization.