系统仿真学报 ›› 2022, Vol. 34 ›› Issue (10): 2194-2203.doi: 10.16182/j.issn1004731x.joss.21-0456

• 仿真建模理论与方法 • 上一篇    下一篇

基于加权局部复杂不变性的时间序列分类算法

李怡桐1(), 刘晓涛1(), 刘静1, 吴凯2   

  1. 1.西安电子科技大学 广州研究院,广东 广州 510555
    2.西安电子科技大学 人工智能学院,陕西 西安 710071
  • 收稿日期:2021-05-21 修回日期:2021-09-01 出版日期:2022-10-30 发布日期:2022-10-18
  • 通讯作者: 刘晓涛 E-mail:958933601@qq.com;xtliu@xidian.edu.cn
  • 作者简介:李怡桐(1997-),女,蒙古族,硕士生,研究方向为时间序列数据挖掘、复杂网络系统认知、智能优化与学习。 E-mail:958933601@qq.com
  • 基金资助:
    国家自然科学基金(61773300);科技部科技创新2030 -“新一代人工智能”重大项目(2018AAA0101302)

Weighted Local Complexity Invariance for Time Series Classification

Yitong Li1(), Xiaotao Liu1(), Jing Liu1, Kai Wu2   

  1. 1.Guangzhou Institute of Technology, Xidian University, Guangzhou 510555, China
    2.Department of Artificial Intelligence, Xidian University, Xi'an 710071, China
  • Received:2021-05-21 Revised:2021-09-01 Online:2022-10-30 Published:2022-10-18
  • Contact: Xiaotao Liu E-mail:958933601@qq.com;xtliu@xidian.edu.cn

摘要:

为解决现有方法对较长、复杂度分布不均序列的错分类问题,提取序列复杂度的局部信息,提出了加权局部复杂度不变性距离(WLCID)包含复杂度局部表征和全局加权整合两个模型。利用滑窗分解序列,结合复杂度不变性距离表示方法提取局部复杂度信息;通过建立类表征模型,以类间距越大的子段对分类正确的贡献度越大为依据,通过归一化累积类间距来量化整合权重。与相似算法的对比实验表明:此方法不仅在复杂度分布不均的数据中表现突出,在大多数测试集也有较好的效果。在分类和聚类任务上精度的提升,说明方法在表示时间序列形态特征的复杂度信息上具有较好的能力。

关键词: 复杂度不变性距离, 局部复杂度表征, 全局复杂度加权整合, 类表征模型, 时间序列分类

Abstract:

Aiming at the misclassification of existing algorithms for long or unevenly distributed time series, the local complexity information is extracted and weighted local complexity-invariant distance (WLCID) is proposed, which includes the local complexity representation model and the weighted global complexity integration model. Sliding window is used to split up time series, and combined with the complexity-invariant distance, the local complexity information can be extracted. As to the class representation model, the integration weights are quantified with the normalized cumulative between-class distance, with the perspective that the subsequence contributes more greatly with larger between-class distance. Compared with other similar algorithms, the proposed method is good at dealing with the data with uneven complexity distribution and can also performs better in most of the test datasets processing. Besides classification tasks, the improvement in the accuracy of clustering tasks also shows its ability to represent the complexity information of the morphological characteristics of time series.

Key words: complexity-invariant distance, local complexity representation, weighted global complexity integration, class representation model, time series classification

中图分类号: