An Approach for Object Recognition in Videos for Vocabulary Extraction

Anh Bao Nguyen Le; Chi Bao Nguyen; Quoc Cuong Dang; Be Hai Danh; Huynh Nhu Le; Huong Hoang Luong; Hai Thanh Nguyen

Nature of Computation and Communication. 9th EAI International Conference, ICTCC 2023, Ho Chi Minh City, Vietnam, October 26-27, 2023, Proceedings

Research Article

An Approach for Object Recognition in Videos for Vocabulary Extraction

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.1007/978-3-031-59462-5_3,
    author={Anh Bao Nguyen Le and Chi Bao Nguyen and Quoc Cuong Dang and Be Hai Danh and Huynh Nhu Le and Huong Hoang Luong and Hai Thanh Nguyen},
    title={An Approach for Object Recognition in Videos for Vocabulary Extraction},
    proceedings={Nature of Computation and Communication. 9th EAI International Conference, ICTCC 2023, Ho Chi Minh City, Vietnam, October 26-27, 2023, Proceedings},
    proceedings_a={ICTCC},
    year={2024},
    month={5},
    keywords={Vocabulary learning English object detection},
    doi={10.1007/978-3-031-59462-5_3}
}

Anh Bao Nguyen Le
Chi Bao Nguyen
Quoc Cuong Dang
Be Hai Danh
Huynh Nhu Le
Huong Hoang Luong
Hai Thanh Nguyen
Year: 2024
An Approach for Object Recognition in Videos for Vocabulary Extraction
ICTCC
Springer
DOI: 10.1007/978-3-031-59462-5_3

Anh Bao Nguyen Le¹, Chi Bao Nguyen¹, Quoc Cuong Dang¹, Be Hai Danh¹, Huynh Nhu Le¹, Huong Hoang Luong, Hai Thanh Nguyen¹^,*

1: College of Information and Communication Technology

*Contact email: nthai.cit@ctu.edu.vn

Abstract

English is the most common language globally, and it is increasingly important. English has been compiled in most online documents, information, and contents. However, with a considerable vocabulary, learning English is difficult for many people to remember. Therefore, many modern technologies have been proposed to support English learning, such as English learning technology through word-matching games to help children become excited and easily approach English from an early age. In addition, translation tools can help users look up vocabularies, antonyms, synonyms, and examples. This study presents a method to support learning English via object detection in videos, images, or even live-stream videos in real-time using deep learning architectures such as You Look Only Once (YOLO) - one of the finest families of object detection models with state-of-the-art performances. The method to obtain an mAP is 55.6 with 17GFlops. The results are vocabulary, meaning, and making sentences with that. Our method has good accuracy in data of 2786 images belonging to 59 classes.

Keywords: Vocabulary, learning English, object detection

Published: 2024-05-03
Appears in: SpringerLink

: http://dx.doi.org/10.1007/978-3-031-59462-5_3

An Approach for Object Recognition in Videos for Vocabulary Extraction

Abstract

About EAI

Community

Publish with EAI