Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module

Yifang Chen; Guanchen Wen; Yong Wang; Jianhua Yang; Yu Zhang

Security and Privacy in New Computing Environments. 6th International Conference, SPNCE 2023, Guangzhou, China, November 25–26, 2023, Proceedings

Research Article

Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.1007/978-3-031-73699-5_22,
    author={Yifang Chen and Guanchen Wen and Yong Wang and Jianhua Yang and Yu Zhang},
    title={Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module},
    proceedings={Security and Privacy in New Computing Environments. 6th International Conference, SPNCE 2023, Guangzhou, China, November 25--26, 2023, Proceedings},
    proceedings_a={SPNCE},
    year={2025},
    month={1},
    keywords={Computer-generated images Vision Transformer Robustness Generalization},
    doi={10.1007/978-3-031-73699-5_22}
}

Yifang Chen
Guanchen Wen
Yong Wang
Jianhua Yang
Yu Zhang
Year: 2025
Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module
SPNCE
Springer
DOI: 10.1007/978-3-031-73699-5_22

Yifang Chen¹, Guanchen Wen¹, Yong Wang¹, Jianhua Yang¹^,*, Yu Zhang¹

1: Guangdong Polytechnic Normal University, GuangZhou

*Contact email: yangjh86@gpnu.edu.cn

Abstract

The correct distinction between highly realistic computer-generated (CG) images and photographic (PG) images has become an important area of research. In recent years, most of the CG image forensics methods are proposed based on deep learning, but the detection performances of these methods still need to be improved, especially in terms of robustness and generalization. To tackle these issues, we leverage theVision Transformer(ViT) model, which excels in capturing the global features of images, and design a Forensic Feature Pre-processing (FFP) module to further improve the detection performance. Experiments are conducted on a large-scale CG image benchmark (LSCGB), which is a challenging dataset for CG image detection. The proposed approach can achieve high detection accuracy. Extensive experiments on different public datasets and common post-processing operations demonstrate our approach can achieve significantly better generalization and robustness than the state-of-the-art approaches.

Keywords: Computer-generated images, Vision Transformer, Robustness, Generalization

Published: 2025-01-01
Appears in: SpringerLink

: http://dx.doi.org/10.1007/978-3-031-73699-5_22

Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module

Abstract

About EAI

Community

Publish with EAI