系统仿真学报 ›› 2021, Vol. 33 ›› Issue (1): 24-36.doi: 10.16182/j.issn1004731x.joss.20-0690
庄穆妮1,3, 李勇1,2, 谭旭1,3, 毛太田1, 蓝凯城3, 邢立宁4
收稿日期:
2020-08-31
修回日期:
2020-11-04
发布日期:
2021-01-18
作者简介:
庄穆妮(1996-),女,硕士生,研究方向为网络舆情分析。E-mail:997737694@qq.com
基金资助:
Zhuang Muni1,3, Li Yong1,2, Tan Xu1,3, Mao Taitian1, Lan Kaicheng3, Xing Lining4
Received:
2020-08-31
Revised:
2020-11-04
Published:
2021-01-18
摘要: 构建大规模网络舆情演化仿真模型,对新冠疫情武汉重灾区与全国其他地区采取差异化的应急管理和舆情疏导具有指导价值。为实现主题细粒度的舆情情感演化仿真,将LDA(Latent Dirichlet Allocation)主题模型与BERT(Bidirectional Encoder Representations from Transformers)词向量深度融合,优化主题向量助力文本主题聚类;同时,在改进BERT预训练任务的基础上,叠加深度预训练任务,以提高模型在情感分类中的精确度。结果表明:在主题向量训练过程中,改进的BERT-LDA模型较原始LDA模型NPMI(Normalized Pointwise Mutual Information)值提升0.357;在疫情事件情感分类任务上,AUC(Area Under the Curve)值超过了99.6%,证明其能够有效运用于大规模网络舆情演化仿真。
中图分类号:
庄穆妮, 李勇, 谭旭, 毛太田, 蓝凯城, 邢立宁. 基于BERT-LDA模型的新冠肺炎疫情网络舆情演化仿真[J]. 系统仿真学报, 2021, 33(1): 24-36.
Zhuang Muni, Li Yong, Tan Xu, Mao Taitian, Lan Kaicheng, Xing Lining. Evolutionary Simulation of Online Public Opinion Based on the BERT-LDA Model under COVID-19[J]. Journal of System Simulation, 2021, 33(1): 24-36.
[1] 吴世文. 重大突发公共卫生事件中的伪信息传播、治理困境及其突破路径——以新冠肺炎疫情为例[J].电子政务, 2020(9): 40-50. Wu Shiwen.Novel Coronavirus Pneumonia, the Spread of False Information, and the Way to Solve the Problem: Taking the New Crown Pneumonia Epidemic as an Example[J]. E-Government, 2020(9): 40-50. [2] 齐佳音, 方滨兴. 重大突发事件中网络舆情引导及治理研究—以新型冠状病毒肺炎疫情为例[J]. 上海对外经贸大学学报, 2020, 27(3): 5-13. Qi Jiayin, Fang Binxing.Network Public Opinion Response and Governance Innovation in Serious Emergencies: Take the COVID-19 Epidemic as an Example[J]. Journal of Shanghai University of International Business and Economics, 2020, 27(3): 5-13. [3] 曹武军, 陈秦秀, 薛朝改. 重大疫情网络舆情防控策略研究[J]. 情报杂志, 2020, 39(10): 107-114. Cao Wujun, Chen Qinxiu, Xue Chaogai.Research on Online Public Opinion Prevention and Control Strategies for Major Epidemic Diseases[J]. Journal of Intelligence, 2020, 39(10): 107-114. [4] 王文, 王树锋, 李洪华. 基于文本语义和表情倾向的微博情感分析方法[J]. 南京理工大学学报, 2014, 38(6): 733-738,749. Wang Wen, Wang Shufeng, Li Honghua.Microblogging Sentiment Analysis Method Based on Text Semantics and Expression Tendentiousness[J]. Journal of Nanjing University of Science and Technology, 2014, 38(6): 733-738,749. [5] Bose R, Dey R K, Roy S, et al.Analyzing Political Sentiment Using Twitter Data[C]. Information and Communication Technology for Intelligent Systems. Singapore: Springer, 2019: 427-436. [6] Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment Classification Using Machine Learning Techniques [J/OL]. EMNLP, 2002, 10: 79-86[2020-05-21]. https://arxiv.org/abs/ cs/0205070. [7] Zhang H. The Optimality of Naive Bayes [J/OL]. American Association for Artificial Intelligence, 2004. [2020-05-21]. https://www.aaai.org. [8] Purnamasari N M G D, Fauzi M A, Indriati L S D. Cyberbullying Identification in Twitter Using Support Vector Machine and Information Gain Based Feature Selection[J]. Indonesian Journal of Electrical Engineering and Computer Science (S2502-4752), 2020, 18(3): 1494-1500. [9] Sherstinsky A.Fundamentals of Recurrent Neural Network (RNN) and Long Short-term Memory (LSTM) Network[J]. Physica D: Nonlinear Phenomena (S0167-2789), 2020, 404: 132306. [10] Vaswani A, Shazeer N, Parmar N, et al.Attention is All You Need[C]. Advances in Neural Information Processing Systems, 2017: 5998-6008. [11] Devlin J, Chang M W, Lee K, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding[C/OL]. NAACL-HLT (1). 2019 [2020-05-21]. https://arxiv.org/abs/1810.04805. [12] 杨晨, 宋晓宁, 宋威. SentiBERT: 结合情感信息的预训练语言模型[J]. 计算机科学与探索, 2020, 14(9): 1563-1570. Yang Chen, Song Xiaoning, Song Wei.SentiBERT: A Pretraining Language Model Combining Sentiment Information[J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(9): 1563-1570. [13] Sun C, Qiu X, Xu Y, et al. How to Fine-Tune BERT for Text Classification?[J/OL]. Computation and Language, 2019, 11856: 194-206[2020-05-21]. https://doi.org/10. 1007/978-3-030-32381-3_16. [14] Blei D M, Ng A Y, Jordan M I.Latent Dirichlet Allocation[J]. Journal of Machine Learning Research (S1532-4435), 2003, 3: 993-1022. [15] He K, Zhang X, Ren S, et al.Deep Residual Learning for Image Recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, NV, USA: IEEE Xplore, 2016: 770-778. [16] Kingma D P, Ba J. Adam: A Method for Stochastic Optimization[J]. Learning, 2014 [2020-05-21]. https://arxiv. org/abs/1412.6980. [17] Wang G, Wong K W, Lu J. AUC-Based Extreme Learning Machines for Supervised and Semi-Supervised Imbalanced Classification[J]. IEEE Transactions on Systems, Man,Cybernetics: Systems (S2168-2216), 2020: 1-12[2020-05-21]. https://ieeexplore.ieee.org/ abstract/document/9063675. [18] Lalmas M C J V. Information Retrieval: Uncertainty and Logics: Advanced Models for the Representation and Retrieval of Information[M]. Boston: Kluwer Academic Publishers, 1998. [19] Zhang Z, Sabuncu M. Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels[C/OL]. Advances in Neural Information Processing Systems.2018: 8778-8788 [2020-05-21]. https://papers.nips.cc/paper/2018/hash/f2925f97bc13ad2852a7a551802feea0-Abstract.html. [20] Kumar N, Deepak G, Santhanavijayan A.A Novel Semantic Approach for Intelligent Response Generation using Emotion Detection Incorporating NPMI Measure[J]. Procedia Computer Science (S1877-0509), 2020, 167: 571-579. [21] 郭业才, 张浩然. 基于改进LDA和自编码器的调制识别算法[J/OL].系统仿真学报: 1-6 [2020-05-21]. http://kns.cnki.net/kcms/detail/11.3092.V.20200102.1527. 008.html. Guo Yecai, Zhang Haoran. Modulation Recognition Algorithm Based on Improved LDA and Autoencoders [J/OL]. Journal of System Simulation: 1-6 [2020-05-21]. http://kns.cnki.net/kcms/detail/11.3092.V.20200102.1527. 008.html. [22] Chen X, Xu L, Liu Z, et al.Joint Learning of Character and Word Embeddings[C]. International Conference on Artificial Intelligence. Argentina: AAAI Press, 2015: 1236-1242. [23] 郭景萍. 社会公共安全视野下的情感安全调控[J]. 湖南师范大学社会科学学报, 2009, 38(2): 87-90. Guo Jingping.Emotional Security Control on the Perspective of Social Public Safety[J]. Journal of Social Science of Hunan Normal University, 2009, 38(2): 87-90. |
[1] | 李智杰, 石昊琦, 李昌华, 张颉. 基于改进遗传算法的影像中心布局优化方法[J]. 系统仿真学报, 2022, 34(6): 1173-1184. |
[2] | 陈斌, 刘悦, 杨亚磊. 基于STN的机场航班过站保障时间协同规划建模[J]. 系统仿真学报, 2022, 34(6): 1196-1207. |
[3] | 杨凯, 陈纯毅, 胡小娟, 于海洋. 蒙卡渲染画面多特征非局部均值滤波降噪算法[J]. 系统仿真学报, 2022, 34(6): 1259-1266. |
[4] | 陈麒, 崔昊杨. 基于改进鸽群层级的无人机集群视觉巡检模型[J]. 系统仿真学报, 2022, 34(6): 1275-1285. |
[5] | 王沐晴, 张磊, 范秀敏, 骆晓萌, 朱文敏. VR外设驱动的虚拟人姿态优化仿真方法[J]. 系统仿真学报, 2022, 34(6): 1296-1303. |
[6] | 陆承, 靳学胜. 基于Steam VR的交互仿真水枪灭火训练系统设计[J]. 系统仿真学报, 2022, 34(6): 1312-1319. |
[7] | 高宏鼐, 付丽疆, 夏倩, 郭亚. 可观测度在光合作用模型性能评估中的应用[J]. 系统仿真学报, 2022, 34(6): 1330-1342. |
[8] | 倪凌佳, 黄晓霞, 李红旮, 张子博. 基于协作式深度强化学习的火灾应急疏散仿真研究[J]. 系统仿真学报, 2022, 34(6): 1353-1366. |
[9] | 蒙盾, 胡卓, 张华军. 基于改进A*算法的多层邮轮疏散系统仿真[J]. 系统仿真学报, 2022, 34(6): 1375-1382. |
[10] | 郭宇飞, 赵康, 海永清. 面向有限元分析的三角网格布尔运算方法[J]. 系统仿真学报, 2022, 34(5): 1003-1014. |
[11] | 吴桐, 王清辉, 徐志佳. 三周期极小曲面多孔材料渗透率尺度特性研究[J]. 系统仿真学报, 2022, 34(5): 1015-1024. |
[12] | 蒋阳升, 王思琛, 高宽, 刘梦, 姚志洪. 混入智能网联车队的混合交通流元胞自动机模型[J]. 系统仿真学报, 2022, 34(5): 1025-1032. |
[13] | 梁江涛, 王慧琴. 基于改进蚁群算法的建筑火灾疏散路径规划研究[J]. 系统仿真学报, 2022, 34(5): 1044-1053. |
[14] | 张其文, 张斌. 基于教学优化算法求解置换流水车间调度问题[J]. 系统仿真学报, 2022, 34(5): 1054-1063. |
[15] | 邢根上, 鲁芳, 李书山, 罗定提. 基于产品体验性的供应链交货模型与仿真研究[J]. 系统仿真学报, 2022, 34(5): 1064-1075. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||