[1] 翟东海, 余江, 高飞, 等. 最大距离法选取初始簇中心的K-means文本聚类算法的研究[J]. 计算机应用研究, 2014, 31(3): 713-715. Zhai Donghai, Yu Jiang, Gao Fei.K-means text clustering algorithm based on initial cluster centers selection according to maximum distance[J]. Application Research of Computers, 2014, 31(3): 713-715. [2] Sholom M Weiss, Nitin Indurkhya, Tong Zhang.Fundamentals of predictive text mining [M]. Xi’an, China: Xi’an Jiaotong University Press, 2012: 97-103. [3] 彭京, 杨冬青, 唐世渭, 等. 一种基于语义内积空间模型的文本聚类算法[J]. 计算机学报, 2007, 30(8): 1354-1363. Peng Jing, Yang Dongqing, Tang Shiwei.A Novel Text Clustering Algorithm Based on Inner Product Space Model of Semantic[J]. Chnese Journal of Computers, 2007, 30(8): 1354-1363. [4] 邓海, 覃华, 孙欣. 一种优化初始中心的K-means聚类算法[J]. 计算机技术与发展, 2013, 23(11): 42-45. Deng Hai, Tan Hua, Sun Xin.A K-means Clustering Algorithm of Meliorated Initial Center[J]. Computer Technology and Development, 2013, 23(11): 42-45. [5] Wong K C.A short survey on data clustering algorithms[C]// International Conference on Soft Computing and Machine Intelligence. USA: IEEE, 2015: 64-68. [6] 熊忠阳, 陈若田, 张玉芳. 一种有效的K-means聚类中心初始化方法[J]. 计算机应用研究, 2011, 28(11): 4188-4190. Xiong Zhongyang, Chen Ruotian, Zhang Yufang.Effective method for cluster centers’s initialization in K-means clustering[J]. Application Research of Computers, 2011, 28(11): 4188-4190. [7] 龚静, 李安民. 一种改进的k-means中文文本聚类算法[J]. 湖南工业大学学报, 2008, 22(2): 52-54. Gong Jing, Li Anmin.Clustering Algorithm of One Improved K-means Chinese Text[J]. Journal of Hunan University of Technology, 2008, 22(2): 52-54. [8] Shehroz S Khan, Amir Ahmad.Cluster center initialization algorithm for K-Means clustering[J]. Pattern Recognition Letters (S0167-8655), 2004, 25(11): 1293-1302. [9] 牛棍, 张舒博, 陈俊亮. 融合网格密度的聚类中心初始化方案[J]. 北京邮电大学学报, 2005, 30(2): 7-10. Niu Kun, Zhang Shubo, Chen Junliang.A Cell Density Enabled Schema for Initializing Cluster Centers[J]. Journal of Beijing University of Posts and Telecommunications, 2005, 30(2): 7-10. [10] 张健沛, 杨悦, 杨静, 等. 基于最优划分的K-Means初始聚类中心选取算法[J]. 系统仿真学报, 2009, 21(9): 2586-2590. Zhang Jianpei, Yang Yue, Yang Jing.Algorithm for Initialization of K-means Clustering Center Based on Optimized-Division[J]. Journal of System Simulation (S1004-731X), 2009, 21(9): 2586-2590. [11] 何亮亮. SVD在文本分类中的应用 [D]. 广州: 华南理工大学, 2012. He Liangliang.Application of the SVD in text classification [D]. Guangzhou, China: South China University of Technology, 2012. [12] 吴夙慧, 成颖, 郑彦宁, 等. 文本聚类中文本表示和相似度计算研究综述[J]. 情报科学, 2012, 22(4): 22-25. Wu Suhui, Cheng Ying, Zhen Yanyu.A Review of Text Representation and Similarity Calculation in Text Clustering[J]. Information Science, 2012, 22(4): 22-25. [13] 黄承慧, 印鉴, 侯昉. 一种结合词项语义信息和TF-IDF方法的文本相似度量方法[J]. 计算机学报, 2011, 34(5): 856-864. Huang Chenghui, Yin Jian, Hou Fang.A Text Similarity Measurement Combining Word Semantic Information with TF-IDF Method[J]. Chinese Journal of Computers, 2011, 34(5): 856-864. [14] 林少波, 杨丹, 徐玲. 基于类别相关的新文本特征提取方法[J]. 计算机应用研究, 2012, 29(5): 1680-1683. Lin Shaobo, Yang Dan, Xu Ling.New Approach to Feature Selection for Text Categorization Using Class Correlation[J]. Application Research of Computers, 2012, 29(5): 1680-1683. [15] 周昭涛. 文本聚类分析效果评价及文本表示研究 [D]. 北京: 中国科学院研究生院(计算技术研究所), 2005. Zhou Zhaotao.Quality Evaluation of Text Clustering Results and Investigation on Text Representation [D]. Beijing, China: Graduate University of Chinese Academy of Science (Computer Software and Theory), 2005. [16] K Van Rijsbergen.Information retrieval [M]. London, UK: Butterworths Press, 1979: 267-301. [17] Gu M, Demmel J, Dhillon I.LAPACK Working Note 88: Efficient Computation of the Singular Value Decomposition with Applications to Least Squares Problems[M]. USA: University of Tennessee, 1994: 68-70. [18] 廖安平, 刘建州. 矩阵论[M]. 湖南: 湖南大学出版社, 2005: 57-58. Liao Anping,Liu Jianzhou.Matrix Theory [M]. Hunan, China: Hunan University Press, 2005: 57-58. [19] 吴军. 数学之美[M]. 北京: 人民邮电出版社, 2014: 136-141. Wu Jun.The beauty of Mathematics [M]. Beijing, China: People Post Press, 2014: 136-141. [20] 王怡, 盖杰, 武港山, 等. 基于潜在语义分析的中文文本层次分类技术[J]. 计算机应用研究, 2004, 21(8): 151-154. Wang Yi, Gai Jie, Wu Gangshan.Technology of Chinese Documents Multi-hierarchy Categorization Based on Latent Semantic Analysis[J]. Application Research of Computers, 2004, 21(8): 151-154. [21] Golub G, Kahan W.Calculating the singular values and pseudo-inverse of matrix[J]. Siam Journal on Numerical Analysis (S1095-7170), 1965, 2(2): 205-224. [22] 蔡宇浩, 梁永全, 樊建聪, 等. 加权局部方差优化初始簇中心的K-means算法[J]. 计算机科学与探索, 2016, 10(5): 732-741. Cai Yuhao, Liang Yongquan, Fan Jiancong.Optimizing Initial Cluster Centroids by Weighted Local Variance in K-means Algorithm[J]. Journal of Frontiers of Computer Science and Technology, 2016, 10(5): 732-741. |