Scalable Information Systems. 4th International ICST Conference, INFOSCALE 2009, Hong Kong, June 10-11, 2009, Revised Selected Papers

Research Article

Chemical Compounds with Path Frequency Using Multi-Core Technology

Download
430 downloads
  • @INPROCEEDINGS{10.1007/978-3-642-10485-5_19,
        author={Kun-Ming Yu and Yi-Yan Chang and Jiayi Zhou and Chun-Yuan Huang and Whei-meih Chang and Chun-Yuan Lin and Chuan Tang},
        title={Chemical Compounds with Path Frequency Using Multi-Core Technology},
        proceedings={Scalable Information Systems. 4th International ICST Conference, INFOSCALE 2009, Hong Kong, June 10-11, 2009, Revised Selected Papers},
        proceedings_a={INFOSCALE},
        year={2012},
        month={5},
        keywords={Chemical compound feature space Multi-Core Processing Branch-and-Bound OpenMP},
        doi={10.1007/978-3-642-10485-5_19}
    }
    
  • Kun-Ming Yu
    Yi-Yan Chang
    Jiayi Zhou
    Chun-Yuan Huang
    Whei-meih Chang
    Chun-Yuan Lin
    Chuan Tang
    Year: 2012
    Chemical Compounds with Path Frequency Using Multi-Core Technology
    INFOSCALE
    Springer
    DOI: 10.1007/978-3-642-10485-5_19
Kun-Ming Yu1,*, Yi-Yan Chang1,*, Jiayi Zhou1,*, Chun-Yuan Huang1,*, Whei-meih Chang1,*, Chun-Yuan Lin2,*, Chuan Tang3,*
  • 1: Chung Hua University
  • 2: Chang Gung University
  • 3: National Tsing Hua University
*Contact email: yu@chu.edu.tw, frank38@pdlab.csie.chu.edu.tw, jyzhou@pdlab.csie.chu.edu.tw, chunyuan.huang@chu.edu.tw, wmchang@chu.edu.tw, cyulin@mail.cgu.edu.tw, cytang@cs.nthu.edu.tw

Abstract

Drug design is the approach of finding drugs by design using computational tools. When designing a new drug, the structure of the drug molecule can be modeled by classification of potential chemical compounds. Kernel Methods have been successfully used in classifying chemical compounds, within which the most popular one is Support Vector Machine (SVM). In order to classify the characteristics of chemical compounds, methods such as frequency of labeled paths have been proposed to map compounds into feature vectors. In this study, we analyze the path frequencies computed from chemical compounds, and reconstruct all possible compounds that share the same path frequency with the original ones, but differ in their molecular structures. Since the computation time for reconstructing such compounds increase greatly along with the size increase of the compounds, we propose an efficient algorithm based on multi-core processing technology. We report here that our algorithm can infer chemical compounds from path frequency while effectively reduce computation time and obtained high speed up.