el 15(7): e4

Research Article

MSER Based Text Localization for Multi-language Using Double-Threshold Scheme

Download254 downloads
  • @ARTICLE{10.4108/icst.iniscom.2015.258413,
        author={Chayut Wiwatcharakoses and Karn Patanukhom},
        title={MSER Based Text Localization for Multi-language Using Double-Threshold Scheme},
        journal={EAI Endorsed Transactions on e-Learning},
        volume={2},
        number={7},
        publisher={EAI},
        journal_a={EL},
        year={2015},
        month={4},
        keywords={text localization, mser, double threshold, cascade classifier},
        doi={10.4108/icst.iniscom.2015.258413}
    }
    
  • Chayut Wiwatcharakoses
    Karn Patanukhom
    Year: 2015
    MSER Based Text Localization for Multi-language Using Double-Threshold Scheme
    EL
    EAI
    DOI: 10.4108/icst.iniscom.2015.258413
Chayut Wiwatcharakoses1, Karn Patanukhom1,*
  • 1: Chiang Mai University
*Contact email: karn@eng.cmu.ac.th

Abstract

In this paper, a region-based text localization that is robust for multiple languages is presented. Maximally Stable Extremal Regions (MSERs) are used for detecting candidates of text areas. The MSER components are grouped based on their connectivity in a feature space by using a new proposed rule for assigning the connectivity. The groups of components are classified into three classes that are text regions with high confidence, text region with low confidence, and non-text regions. A chain of text attribute constraint decision with the double-threshold scheme is developed to identify text regions. A sequence of constraint decision is designed to minimize the complexity based on short-circuit evaluation of logic operators. The regions that satisfy all strong constraints will be considered as text regions with high confidence while the regions that fail in some strong constraints but satisfy all weak constraints will be considered as text regions with low confidence. The final text regions are obtained from all text regions with high confidence and text regions with low confidence that have connectivity to text regions with high confidence. The proposed scheme is evaluated by using the natural scene images that consist of totally nine languages with different text alignments and camera views. The experiment shows that our proposed scheme can provide the satisfy results in comparison with baseline method.