Proceedings of the 2nd International Conference on Information, Control and Automation, ICICA 2022, December 2-4, 2022, Chongqing, China

Research Article

Research and Regression Analysis of Enterprise Attendance Data Based on Big Data Technology

Download165 downloads
  • @INPROCEEDINGS{10.4108/eai.2-12-2022.2328007,
        author={Lei  Yao and Yuchuan  Bian and Xiaoming  Ji and Mingyu  Guo},
        title={Research and Regression Analysis of Enterprise Attendance Data Based on Big Data Technology},
        proceedings={Proceedings of the 2nd International Conference on Information, Control and Automation, ICICA 2022, December 2-4, 2022, Chongqing, China},
        publisher={EAI},
        proceedings_a={ICICA},
        year={2023},
        month={3},
        keywords={big data technology; attendance data; regression analysis; decision tree},
        doi={10.4108/eai.2-12-2022.2328007}
    }
    
  • Lei Yao
    Yuchuan Bian
    Xiaoming Ji
    Mingyu Guo
    Year: 2023
    Research and Regression Analysis of Enterprise Attendance Data Based on Big Data Technology
    ICICA
    EAI
    DOI: 10.4108/eai.2-12-2022.2328007
Lei Yao1,*, Yuchuan Bian1, Xiaoming Ji1, Mingyu Guo1
  • 1: Beijing Institute of Spacecraft Environment Engineering
*Contact email: ylcasc@163.com

Abstract

The traditional database technology can hardly complete full data analysis on mass accumulated attendance data for the enterprise over the years, while big data technology is regarded as a better way to achieve it. After the study on a data-driven attendance management system, this paper proposed a decision tree regression analysis method based on the Spark platform, analyzing mass attendance data in a short time and generating a decision tree to show what affects employee attendance. The decision tree has certain reference significance for the monitoring and warning of employee attendance and employee management decision. This paper detailed the process of data-parallel processing and the regression analysis method of employee attendance by the Classification and Regression Tree (CART) algorithm and compared the efficiency of traditional database technology and parallel data processing for employee attendance data analysis, which verified the effectiveness of the method.