
Research Article
An Approach for Object Recognition in Videos for Vocabulary Extraction
@INPROCEEDINGS{10.1007/978-3-031-59462-5_3, author={Anh Bao Nguyen Le and Chi Bao Nguyen and Quoc Cuong Dang and Be Hai Danh and Huynh Nhu Le and Huong Hoang Luong and Hai Thanh Nguyen}, title={An Approach for Object Recognition in Videos for Vocabulary Extraction}, proceedings={Nature of Computation and Communication. 9th EAI International Conference, ICTCC 2023, Ho Chi Minh City, Vietnam, October 26-27, 2023, Proceedings}, proceedings_a={ICTCC}, year={2024}, month={5}, keywords={Vocabulary learning English object detection}, doi={10.1007/978-3-031-59462-5_3} }
- Anh Bao Nguyen Le
Chi Bao Nguyen
Quoc Cuong Dang
Be Hai Danh
Huynh Nhu Le
Huong Hoang Luong
Hai Thanh Nguyen
Year: 2024
An Approach for Object Recognition in Videos for Vocabulary Extraction
ICTCC
Springer
DOI: 10.1007/978-3-031-59462-5_3
Abstract
English is the most common language globally, and it is increasingly important. English has been compiled in most online documents, information, and contents. However, with a considerable vocabulary, learning English is difficult for many people to remember. Therefore, many modern technologies have been proposed to support English learning, such as English learning technology through word-matching games to help children become excited and easily approach English from an early age. In addition, translation tools can help users look up vocabularies, antonyms, synonyms, and examples. This study presents a method to support learning English via object detection in videos, images, or even live-stream videos in real-time using deep learning architectures such as You Look Only Once (YOLO) - one of the finest families of object detection models with state-of-the-art performances. The method to obtain an mAP is 55.6 with 17GFlops. The results are vocabulary, meaning, and making sentences with that. Our method has good accuracy in data of 2786 images belonging to 59 classes.