
Research Article
FDD-YOLO: A Lightweight Multi-scale Prohibited Items Detection Model
@ARTICLE{10.4108/airo.10277, author={Zilong Xue and Bo Wang and Yuanwei Xie and Zhibin Li and Xiaozheng Fan and Chenyoukang Lin and Peiyang Wei and Linlin Chen and Xun Deng and Jianhong Gan}, title={FDD-YOLO: A Lightweight Multi-scale Prohibited Items Detection Model}, journal={EAI Endorsed Transactions on AI and Robotics}, volume={4}, number={1}, publisher={EAI}, journal_a={AIRO}, year={2025}, month={11}, keywords={Frequency Domain Decomposition Network (FDDN), Deformable Elastic Fusion Pyramid (DEFP), Dual-channel Convolution (DualConv), Prohibited Items, X-ray Image}, doi={10.4108/airo.10277} }- Zilong Xue
Bo Wang
Yuanwei Xie
Zhibin Li
Xiaozheng Fan
Chenyoukang Lin
Peiyang Wei
Linlin Chen
Xun Deng
Jianhong Gan
Year: 2025
FDD-YOLO: A Lightweight Multi-scale Prohibited Items Detection Model
AIRO
EAI
DOI: 10.4108/airo.10277
Abstract
X-ray security inspection faces challenges such as severe occlusion, scale variation, and complex background when detecting prohibited items, requiring real-time and accurate detection. Although the YOLO series of models has high inference efficiency, they suffer from problems such as feature redundancy, insufficient fine-grained feature extraction, and limited adaptability to overlapping objects. To overcome these limitations, we propose FDD-YOLO and design three novel modules: (1) The Frequency Domain Decomposition Network (FDDN) in the backbone network enhances the edges of metal objects and the contours of liquid containers by decomposing high-frequency and low-frequency features while reducing computational redundancy; (2) The Deformable Elastic Fusion Pyramid (DEFP) in the neck network adopts dynamic channel allocation and multi-scale deformable convolution to handle the geometric changes of folded and overlapping objects; (3) The lightweight Dual-channel Convolution (DualConv) improves multi-scale feature capture through grouping and point-by-point convolution, thereby reducing the number of parameters while improving the accuracy of small object detection. Tests on the SIXray, HIXray, and private GIX datasets show that FDD-YOLO achieves 2.6%, 3.2%, and 8.6% higher mAP than YOLOv11n, respectively, achieving accuracies of 94.8%, 84%, and 71.8%, respectively. This framework also reduces the number of parameters by 30.6% and the number of FLOPs by 26.9%, achieving an optimal balance between accuracy and efficiency, setting a new technical benchmark for real-time security inspections.
Copyright © 2025 Zilong Xue et al., licensed to EAI. This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.


