About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Nature of Computation and Communication. 9th EAI International Conference, ICTCC 2023, Ho Chi Minh City, Vietnam, October 26-27, 2023, Proceedings

Research Article

An Approach for Object Recognition in Videos for Vocabulary Extraction

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-031-59462-5_3,
        author={Anh Bao Nguyen Le and Chi Bao Nguyen and Quoc Cuong Dang and Be Hai Danh and Huynh Nhu Le and Huong Hoang Luong and Hai Thanh Nguyen},
        title={An Approach for Object Recognition in Videos for Vocabulary Extraction},
        proceedings={Nature of Computation and Communication. 9th EAI International Conference, ICTCC 2023, Ho Chi Minh City, Vietnam, October 26-27, 2023, Proceedings},
        proceedings_a={ICTCC},
        year={2024},
        month={5},
        keywords={Vocabulary learning English object detection},
        doi={10.1007/978-3-031-59462-5_3}
    }
    
  • Anh Bao Nguyen Le
    Chi Bao Nguyen
    Quoc Cuong Dang
    Be Hai Danh
    Huynh Nhu Le
    Huong Hoang Luong
    Hai Thanh Nguyen
    Year: 2024
    An Approach for Object Recognition in Videos for Vocabulary Extraction
    ICTCC
    Springer
    DOI: 10.1007/978-3-031-59462-5_3
Anh Bao Nguyen Le1, Chi Bao Nguyen1, Quoc Cuong Dang1, Be Hai Danh1, Huynh Nhu Le1, Huong Hoang Luong, Hai Thanh Nguyen1,*
  • 1: College of Information and Communication Technology
*Contact email: nthai.cit@ctu.edu.vn

Abstract

English is the most common language globally, and it is increasingly important. English has been compiled in most online documents, information, and contents. However, with a considerable vocabulary, learning English is difficult for many people to remember. Therefore, many modern technologies have been proposed to support English learning, such as English learning technology through word-matching games to help children become excited and easily approach English from an early age. In addition, translation tools can help users look up vocabularies, antonyms, synonyms, and examples. This study presents a method to support learning English via object detection in videos, images, or even live-stream videos in real-time using deep learning architectures such as You Look Only Once (YOLO) - one of the finest families of object detection models with state-of-the-art performances. The method to obtain an mAP is 55.6 with 17GFlops. The results are vocabulary, meaning, and making sentences with that. Our method has good accuracy in data of 2786 images belonging to 59 classes.

Keywords
Vocabulary learning English object detection
Published
2024-05-03
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-031-59462-5_3
Copyright © 2023–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL