Time Series Prediction with Preprocessing and Clustering

Haoxuan Sun; Shuai Lin; Lin Han; Jidong Feng; Mingxu Sun

IoT and Big Data Technologies for Health Care. Second EAI International Conference, IoTCare 2021, Virtual Event, October 18-19, 2021, Proceedings, Part II

Research Article

Time Series Prediction with Preprocessing and Clustering

Download

155 downloads

Cite: BibTeX Plain Text

@INPROCEEDINGS{10.1007/978-3-030-94182-6_27,
    author={Haoxuan Sun and Shuai Lin and Lin Han and Jidong Feng and Mingxu Sun},
    title={Time Series Prediction with Preprocessing and Clustering},
    proceedings={IoT and Big Data Technologies for Health Care. Second EAI International Conference, IoTCare 2021, Virtual Event, October 18-19, 2021, Proceedings, Part II},
    proceedings_a={IOTCARE PART 2},
    year={2022},
    month={6},
    keywords={Time series Kmeans clustering Similarity},
    doi={10.1007/978-3-030-94182-6_27}
}

Haoxuan Sun
Shuai Lin
Lin Han
Jidong Feng
Mingxu Sun
Year: 2022
Time Series Prediction with Preprocessing and Clustering
IOTCARE PART 2
Springer
DOI: 10.1007/978-3-030-94182-6_27

Haoxuan Sun¹, Shuai Lin², Lin Han¹^,*, Jidong Feng¹, Mingxu Sun¹

1: School of EE, University of Jinan
2: Shandong Non-metallic Materials Institute

*Contact email: 201921200627@mail.ujn.edu.cn

Abstract

This paper studies the similarity of time series, and studies the influence of weight on prediction results on the basis of clustering. We first introduce the practical significance and research purpose of the selected topic, summarizes the current research situation at home and abroad, and summarizes the research content of this paper. Second, we describe related concepts. Later, based on Dodger data set, we study the flow of total prediction data of time series. First of all, feature extraction of the data, pre-processing work, the original data generation time series. Then the data are processed and divided into training data and test data for the convenience of subsequent processing. Then the clustering algorithm was used to divide the time series into categories, and seven categories were divided according to the characteristics of one week time cycle. The average value of each category is calculated to replace the characteristics of the current category, and then the similarity is compared. Finally, the weight of each category is calculated by similarity degree, and then the data is predicted. MAE, R-squared, MAPE and other indicators were used to analyze and evaluate the forecast data.

Keywords: Time series, Kmeans clustering, Similarity

Published: 2022-06-18
Appears in: SpringerLink

: http://dx.doi.org/10.1007/978-3-030-94182-6_27

Time Series Prediction with Preprocessing and Clustering

Abstract

About EAI

Community

Publish with EAI