About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
IoT and Big Data Technologies for Health Care. Second EAI International Conference, IoTCare 2021, Virtual Event, October 18-19, 2021, Proceedings, Part II

Research Article

Time Series Prediction with Preprocessing and Clustering

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-030-94182-6_27,
        author={Haoxuan Sun and Shuai Lin and Lin Han and Jidong Feng and Mingxu Sun},
        title={Time Series Prediction with Preprocessing and Clustering},
        proceedings={IoT and Big Data Technologies for Health Care. Second EAI International Conference, IoTCare 2021, Virtual Event, October 18-19, 2021, Proceedings, Part II},
        proceedings_a={IOTCARE PART 2},
        year={2022},
        month={6},
        keywords={Time series Kmeans clustering Similarity},
        doi={10.1007/978-3-030-94182-6_27}
    }
    
  • Haoxuan Sun
    Shuai Lin
    Lin Han
    Jidong Feng
    Mingxu Sun
    Year: 2022
    Time Series Prediction with Preprocessing and Clustering
    IOTCARE PART 2
    Springer
    DOI: 10.1007/978-3-030-94182-6_27
Haoxuan Sun1, Shuai Lin2, Lin Han1,*, Jidong Feng1, Mingxu Sun1
  • 1: School of EE, University of Jinan
  • 2: Shandong Non-metallic Materials Institute
*Contact email: 201921200627@mail.ujn.edu.cn

Abstract

This paper studies the similarity of time series, and studies the influence of weight on prediction results on the basis of clustering. We first introduce the practical significance and research purpose of the selected topic, summarizes the current research situation at home and abroad, and summarizes the research content of this paper. Second, we describe related concepts. Later, based on Dodger data set, we study the flow of total prediction data of time series. First of all, feature extraction of the data, pre-processing work, the original data generation time series. Then the data are processed and divided into training data and test data for the convenience of subsequent processing. Then the clustering algorithm was used to divide the time series into categories, and seven categories were divided according to the characteristics of one week time cycle. The average value of each category is calculated to replace the characteristics of the current category, and then the similarity is compared. Finally, the weight of each category is calculated by similarity degree, and then the data is predicted. MAE, R-squared, MAPE and other indicators were used to analyze and evaluate the forecast data.

Keywords
Time series Kmeans clustering Similarity
Published
2022-06-18
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-030-94182-6_27
Copyright © 2021–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL