8th International Conference on Communications and Networking in China

Research Article

Hardware Acceleration of Barrier Communication for Large Scale Parallel Computer

  • @INPROCEEDINGS{10.1109/ChinaCom.2013.6694666,
        author={Zhengbin Pang and Shaogang Wang and Dan Wu and Pingjing Lu},
        title={Hardware Acceleration of Barrier Communication for Large Scale Parallel Computer},
        proceedings={8th International Conference on Communications and Networking in China},
        publisher={IEEE},
        proceedings_a={CHINACOM},
        year={2013},
        month={11},
        keywords={collective communication nic communication offload},
        doi={10.1109/ChinaCom.2013.6694666}
    }
    
  • Zhengbin Pang
    Shaogang Wang
    Dan Wu
    Pingjing Lu
    Year: 2013
    Hardware Acceleration of Barrier Communication for Large Scale Parallel Computer
    CHINACOM
    IEEE
    DOI: 10.1109/ChinaCom.2013.6694666
Zhengbin Pang1, Shaogang Wang1,*, Dan Wu1, Pingjing Lu1
  • 1: National University of Defense Technology
*Contact email: wshaogang79@gmail.com

Abstract

MPI collective communication overhead dominates the communication cost for large scale parallel computers, scalability and operation latency for collective communication is critical for next generation computers. This paper proposes a fast and scalable barrier communication offload approach which supports millions of compute cores. Following our approach, the barrier operation sequence is packed by host MPI driver into the barrier "descriptor", which is pushed to the NIC (Network-Interfaces). The NIC can complete the barrier automatically following its algorithm descriptor. Our approach accelerates both intra-node and inter-node barrier communication. We show that our approach achieves both barrier performance and scalability, especially for large scale computer system. This paper also proposes an extendable and easy-to-implement NIC architecture supporting barrier offload communication and also other communication pattern.