Research Article
Research and Regression Analysis of Enterprise Attendance Data Based on Big Data Technology
@INPROCEEDINGS{10.4108/eai.2-12-2022.2328007, author={Lei Yao and Yuchuan Bian and Xiaoming Ji and Mingyu Guo}, title={Research and Regression Analysis of Enterprise Attendance Data Based on Big Data Technology}, proceedings={Proceedings of the 2nd International Conference on Information, Control and Automation, ICICA 2022, December 2-4, 2022, Chongqing, China}, publisher={EAI}, proceedings_a={ICICA}, year={2023}, month={3}, keywords={big data technology; attendance data; regression analysis; decision tree}, doi={10.4108/eai.2-12-2022.2328007} }
- Lei Yao
Yuchuan Bian
Xiaoming Ji
Mingyu Guo
Year: 2023
Research and Regression Analysis of Enterprise Attendance Data Based on Big Data Technology
ICICA
EAI
DOI: 10.4108/eai.2-12-2022.2328007
Abstract
The traditional database technology can hardly complete full data analysis on mass accumulated attendance data for the enterprise over the years, while big data technology is regarded as a better way to achieve it. After the study on a data-driven attendance management system, this paper proposed a decision tree regression analysis method based on the Spark platform, analyzing mass attendance data in a short time and generating a decision tree to show what affects employee attendance. The decision tree has certain reference significance for the monitoring and warning of employee attendance and employee management decision. This paper detailed the process of data-parallel processing and the regression analysis method of employee attendance by the Classification and Regression Tree (CART) algorithm and compared the efficiency of traditional database technology and parallel data processing for employee attendance data analysis, which verified the effectiveness of the method.