Wireless Internet. 10th International Conference, WiCON 2017, Tianjin, China, December 16-17, 2017, Proceedings

Research Article

Improving Multiple-Instance Learning via Disambiguation by Considering Generalization

  • @INPROCEEDINGS{10.1007/978-3-319-90802-1_37,
        author={Lu Zhao and Youjian Yu and Hao Chen and Liming Yuan},
        title={Improving Multiple-Instance Learning via Disambiguation by Considering Generalization},
        proceedings={Wireless Internet. 10th International Conference, WiCON 2017, Tianjin, China, December 16-17, 2017, Proceedings},
        proceedings_a={WICON},
        year={2018},
        month={5},
        keywords={Multiple-instance learning Disambiguation Generalization ability},
        doi={10.1007/978-3-319-90802-1_37}
    }
    
  • Lu Zhao
    Youjian Yu
    Hao Chen
    Liming Yuan
    Year: 2018
    Improving Multiple-Instance Learning via Disambiguation by Considering Generalization
    WICON
    Springer
    DOI: 10.1007/978-3-319-90802-1_37
Lu Zhao1,*, Youjian Yu1, Hao Chen1, Liming Yuan2
  • 1: Tianjin Chengjian University
  • 2: Tianjin University of Technology
*Contact email: zhaolu6892@163.com

Abstract

Multiple-instance learning (MIL) is a variant of the traditional supervised learning. In MIL training examples are bags of instances and labels are associated with bags rather than individual instances. The standard MIL assumption indicates that a bag is labeled positive if at least one of its instances is labeled positive, and otherwise labeled negative. However, many MIL problems do not satisfy this assumption but the more general one that the class of a bag is jointly determined by multiple instances of the bag. To solve such problems, the authors of MILD proposed an efficient disambiguation method to identify the most discriminative instances in training bags and then converted MIL to the standard supervised learning. Nevertheless, MILD does not consider the generalization ability of its disambiguation method, leading to inferior performance compared to other baselines. In this paper, we try to improve the performance of MILD by considering the discrimination of its disambiguation method on the validation set. We have performed extensive experiments on the drug activity prediction and region-based image categorization tasks. The experimental results demonstrate that MILD outperforms other similar MIL algorithms by taking into account the generalization capability of its disambiguation method.