8th International Conference on Communications and Networking in China

Research Article

A Parallel Algorithm of Association Rules Based on Cloud Computing

  • @INPROCEEDINGS{10.1109/ChinaCom.2013.6694632,
        author={yong wang and Zhe Zhang and Fang Wang},
        title={A Parallel Algorithm of Association Rules Based on Cloud Computing},
        proceedings={8th International Conference on Communications and Networking in China},
        publisher={IEEE},
        proceedings_a={CHINACOM},
        year={2013},
        month={11},
        keywords={fp-growth parallel computing cloud computing mapreduce association rules},
        doi={10.1109/ChinaCom.2013.6694632}
    }
    
  • yong wang
    Zhe Zhang
    Fang Wang
    Year: 2013
    A Parallel Algorithm of Association Rules Based on Cloud Computing
    CHINACOM
    IEEE
    DOI: 10.1109/ChinaCom.2013.6694632
yong wang1,*, Zhe Zhang2, Fang Wang2
  • 1: guilin university of electronic technology
  • 2: Guilin University of Electronic Technology
*Contact email: 969055313@qq.com

Abstract

In view of the traditional parallel FP-growth algorithm (PFP)that suffers from two major limitations, namely, multiple database scans requirement (i.e., high I/O cost) and high inter-processor communications cost, therefore we design and implement a kind of parallel association rules mining method based on cloud computing. The algorithm adopts the separation strategy to simply visit a local database only once, thus reducing inter-processor communication I/O overhead. What’s more, the MapReduce model is used to solve the problem of huge amounts of data mining, as well as the calculated execution taking place in the local data storage node, which can avoid large amounts of data on the network transmission and reduce the communication overhead. By using ordinary PC structures, Hadoop cluster experimental results verify that the proposed algorithm based on cloud computing offers higher efficiency and has a good speedup.