Research Article
A Parallel Algorithm of Association Rules Based on Cloud Computing
@INPROCEEDINGS{10.1109/ChinaCom.2013.6694632, author={yong wang and Zhe Zhang and Fang Wang}, title={A Parallel Algorithm of Association Rules Based on Cloud Computing}, proceedings={8th International Conference on Communications and Networking in China}, publisher={IEEE}, proceedings_a={CHINACOM}, year={2013}, month={11}, keywords={fp-growth parallel computing cloud computing mapreduce association rules}, doi={10.1109/ChinaCom.2013.6694632} }
- yong wang
Zhe Zhang
Fang Wang
Year: 2013
A Parallel Algorithm of Association Rules Based on Cloud Computing
CHINACOM
IEEE
DOI: 10.1109/ChinaCom.2013.6694632
Abstract
In view of the traditional parallel FP-growth algorithm (PFP)that suffers from two major limitations, namely, multiple database scans requirement (i.e., high I/O cost) and high inter-processor communications cost, therefore we design and implement a kind of parallel association rules mining method based on cloud computing. The algorithm adopts the separation strategy to simply visit a local database only once, thus reducing inter-processor communication I/O overhead. What’s more, the MapReduce model is used to solve the problem of huge amounts of data mining, as well as the calculated execution taking place in the local data storage node, which can avoid large amounts of data on the network transmission and reduce the communication overhead. By using ordinary PC structures, Hadoop cluster experimental results verify that the proposed algorithm based on cloud computing offers higher efficiency and has a good speedup.