系统仿真学报 ›› 2019, Vol. 31 ›› Issue (12): 2721-2730.doi: 10.16182/j.issn1004731x.joss.19-FZ0289

• 仿真系统与技术 • 上一篇    下一篇

基于相对贡献率的噪声裁剪算法

刘朔瑜, 戴月明   

  1. 江南大学 物联网工程学院,江苏 无锡 214122
  • 收稿日期:2019-04-16 修回日期:2019-07-07 发布日期:2019-12-13
  • 作者简介:刘朔瑜(1993-),男,江苏无锡,硕士生,研究方向为自然语言处理; 戴月明(1968-),男,江苏常熟,硕士,副教授,硕导,研究方向为自然语言处理,人工智能。
  • 基金资助:
    国家自然科学基金(61973138)

Noise Clipping Algorithm Based on Relative Contribution Rate

Liu Shuoyu, Dai Yueming   

  1. School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China
  • Received:2019-04-16 Revised:2019-07-07 Published:2019-12-13

摘要: 提出了一种基于相对贡献率的噪声裁剪算法(Class noise cutting, CNC)。通过计算得到特征对于主题的相对贡献率,利用特征区分度评分挑选对于当前主题分类最有价值的特征集,选出相应的候选类别,减少候选类别集,提高了分类准确率, 加快了分类器的响应速度。与另一种噪声裁剪算法(Eliminating class noise, ECN)比较,CNC具有更高的准确率,由于具有更精简的特征维度词典以及更优异的候选类别集使得响应速度大大加快。

关键词: 相对贡献率, 类别噪声裁剪, 层次结构分类, 特征选择

Abstract: This paper presents a class noise cutting algorithm (Class noise cutting, CNC) based on relative contribution rate. The algorithm calculates the relative contribution rate of features to the theme. The most valuable feature set is selected by using features distinguish rating. The corresponding candidate categories for each feature are selected, to reduece the candidate category set, improves the classification accuracy, and speed up the response speed of the classifier. Compared with another ECN noise cutting algorithm (Eliminating the class whose), CNC-has higher accuracy and because of its simpler feature dimension dictionary and better candidate category set, the response speed is greatly accelerated.

Key words: relative contribution rate, class noise cutting, hierarchical classification, feature selection

中图分类号: