Research Article
Glaucoma Classification using Light Vision Transformer
@ARTICLE{10.4108/eetpht.9.3931, author={Piyush Bhushan Singh and Pawan Singh and Harsh Dev and Anil Tiwari and Devanshu Batra and Brijesh Kumar Chaurasia}, title={Glaucoma Classification using Light Vision Transformer}, journal={EAI Endorsed Transactions on Pervasive Health and Technology}, volume={9}, number={1}, publisher={EAI}, journal_a={PHAT}, year={2023}, month={9}, keywords={Glaucoma, CNN models, Vision Transformer Model, Optimizers, Fundus imaging}, doi={10.4108/eetpht.9.3931} }
- Piyush Bhushan Singh
Pawan Singh
Harsh Dev
Anil Tiwari
Devanshu Batra
Brijesh Kumar Chaurasia
Year: 2023
Glaucoma Classification using Light Vision Transformer
PHAT
EAI
DOI: 10.4108/eetpht.9.3931
Abstract
INTRODUCTION: Nowadays one of the primary causes of permanent blindness is glaucoma. Due to the trade-offs, it makes in terms of portability, size, and cost, fundus imaging is the most widely used glaucoma screening technique. OBJECTIVES:To boost accuracy,focusing on less execution time, and less resources consumption, we have proposed a vision transformer-based model with data pre-processing techniques which fix classification problems. METHODS: Convolution is a “local” technique used by CNNs that is restricted to a limited area around an image. Self-attention, used by Vision Transformers, is a “global” action since it gathers data from the whole image. This makes it possible for the ViT to successfully collect far-off semantic relevance in an image. Several optimizers, including Adamax, SGD, RMSprop, Adadelta, Adafactor, Nadam, and Adagrad, were studied in this paper. We have trained and tested the Vision Transformer model on the IEEE Fundus image dataset having 1750 Healthy and Glaucoma images. Additionally, the dataset was preprocessed using image resizing, auto-rotation, and auto-adjust contrast by adaptive equalization. RESULTS: Results also show that the Nadam Optimizer increased accuracy up to 97% in adaptive equalized preprocessing dataset followed by auto rotate and image resizing operations. CONCLUSION: The experimental findings shows that transformer based classification spurred a revolution in computer vision with reduced time in training and classification.
Copyright © 2023 Singh et al., licensed to EAI. This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.