
Research Article
Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module
@INPROCEEDINGS{10.1007/978-3-031-73699-5_22, author={Yifang Chen and Guanchen Wen and Yong Wang and Jianhua Yang and Yu Zhang}, title={Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module}, proceedings={Security and Privacy in New Computing Environments. 6th International Conference, SPNCE 2023, Guangzhou, China, November 25--26, 2023, Proceedings}, proceedings_a={SPNCE}, year={2025}, month={1}, keywords={Computer-generated images Vision Transformer Robustness Generalization}, doi={10.1007/978-3-031-73699-5_22} }
- Yifang Chen
Guanchen Wen
Yong Wang
Jianhua Yang
Yu Zhang
Year: 2025
Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module
SPNCE
Springer
DOI: 10.1007/978-3-031-73699-5_22
Abstract
The correct distinction between highly realistic computer-generated (CG) images and photographic (PG) images has become an important area of research. In recent years, most of the CG image forensics methods are proposed based on deep learning, but the detection performances of these methods still need to be improved, especially in terms of robustness and generalization. To tackle these issues, we leverage theVision Transformer(ViT) model, which excels in capturing the global features of images, and design a Forensic Feature Pre-processing (FFP) module to further improve the detection performance. Experiments are conducted on a large-scale CG image benchmark (LSCGB), which is a challenging dataset for CG image detection. The proposed approach can achieve high detection accuracy. Extensive experiments on different public datasets and common post-processing operations demonstrate our approach can achieve significantly better generalization and robustness than the state-of-the-art approaches.