About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Security and Privacy in New Computing Environments. 6th International Conference, SPNCE 2023, Guangzhou, China, November 25–26, 2023, Proceedings

Research Article

Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-031-73699-5_22,
        author={Yifang Chen and Guanchen Wen and Yong Wang and Jianhua Yang and Yu Zhang},
        title={Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module},
        proceedings={Security and Privacy in New Computing Environments. 6th International Conference, SPNCE 2023, Guangzhou, China, November 25--26, 2023, Proceedings},
        proceedings_a={SPNCE},
        year={2025},
        month={1},
        keywords={Computer-generated images Vision Transformer Robustness Generalization},
        doi={10.1007/978-3-031-73699-5_22}
    }
    
  • Yifang Chen
    Guanchen Wen
    Yong Wang
    Jianhua Yang
    Yu Zhang
    Year: 2025
    Computer-Generated Image Forensics Based on Vision Transformer with Forensic Feature Pre-processing Module
    SPNCE
    Springer
    DOI: 10.1007/978-3-031-73699-5_22
Yifang Chen1, Guanchen Wen1, Yong Wang1, Jianhua Yang1,*, Yu Zhang1
  • 1: Guangdong Polytechnic Normal University, GuangZhou
*Contact email: yangjh86@gpnu.edu.cn

Abstract

The correct distinction between highly realistic computer-generated (CG) images and photographic (PG) images has become an important area of research. In recent years, most of the CG image forensics methods are proposed based on deep learning, but the detection performances of these methods still need to be improved, especially in terms of robustness and generalization. To tackle these issues, we leverage theVision Transformer(ViT) model, which excels in capturing the global features of images, and design a Forensic Feature Pre-processing (FFP) module to further improve the detection performance. Experiments are conducted on a large-scale CG image benchmark (LSCGB), which is a challenging dataset for CG image detection. The proposed approach can achieve high detection accuracy. Extensive experiments on different public datasets and common post-processing operations demonstrate our approach can achieve significantly better generalization and robustness than the state-of-the-art approaches.

Keywords
Computer-generated images Vision Transformer Robustness Generalization
Published
2025-01-01
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-031-73699-5_22
Copyright © 2023–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL