Glaucoma Classification using Light Vision Transformer

Piyush Bhushan Singh; Pawan Singh; Harsh Dev; Anil Tiwari; Devanshu Batra; Brijesh Kumar Chaurasia

phat 24(1):

Research Article

Glaucoma Classification using Light Vision Transformer

Download399 downloads

Cite: BibTeX Plain Text

@ARTICLE{10.4108/eetpht.9.3931,
    author={Piyush Bhushan Singh and Pawan Singh and Harsh Dev and Anil Tiwari and Devanshu Batra and Brijesh Kumar Chaurasia},
    title={Glaucoma Classification using Light Vision Transformer},
    journal={EAI Endorsed Transactions on Pervasive Health and Technology},
    volume={9},
    number={1},
    publisher={EAI},
    journal_a={PHAT},
    year={2023},
    month={9},
    keywords={Glaucoma, CNN models, Vision Transformer Model, Optimizers, Fundus imaging},
    doi={10.4108/eetpht.9.3931}
}

Piyush Bhushan Singh
Pawan Singh
Harsh Dev
Anil Tiwari
Devanshu Batra
Brijesh Kumar Chaurasia
Year: 2023
Glaucoma Classification using Light Vision Transformer
PHAT
EAI
DOI: 10.4108/eetpht.9.3931

Piyush Bhushan Singh¹^,*, Pawan Singh¹, Harsh Dev², Anil Tiwari¹, Devanshu Batra², Brijesh Kumar Chaurasia²

1: Amity University
2: Pranveer Singh Institute of Technology

*Contact email: piyush.bhadauria@gmail.com

Abstract

INTRODUCTION: Nowadays one of the primary causes of permanent blindness is glaucoma. Due to the trade-offs, it makes in terms of portability, size, and cost, fundus imaging is the most widely used glaucoma screening technique. OBJECTIVES:To boost accuracy,focusing on less execution time, and less resources consumption, we have proposed a vision transformer-based model with data pre-processing techniques which fix classification problems. METHODS: Convolution is a “local” technique used by CNNs that is restricted to a limited area around an image. Self-attention, used by Vision Transformers, is a “global” action since it gathers data from the whole image. This makes it possible for the ViT to successfully collect far-off semantic relevance in an image. Several optimizers, including Adamax, SGD, RMSprop, Adadelta, Adafactor, Nadam, and Adagrad, were studied in this paper. We have trained and tested the Vision Transformer model on the IEEE Fundus image dataset having 1750 Healthy and Glaucoma images. Additionally, the dataset was preprocessed using image resizing, auto-rotation, and auto-adjust contrast by adaptive equalization. RESULTS: Results also show that the Nadam Optimizer increased accuracy up to 97% in adaptive equalized preprocessing dataset followed by auto rotate and image resizing operations. CONCLUSION: The experimental findings shows that transformer based classification spurred a revolution in computer vision with reduced time in training and classification.

Keywords: Glaucoma, CNN models, Vision Transformer Model, Optimizers, Fundus imaging

Received: 2023-07-11
Accepted: 2023-09-05
Published: 2023-09-21
Publisher: EAI

: http://dx.doi.org/10.4108/eetpht.9.3931

Copyright © 2023 Singh et al., licensed to EAI. This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.

Glaucoma Classification using Light Vision Transformer

Abstract

About EAI

Community

Publish with EAI