Gradient-based Deep Reinforcement Learning Interpretation Methods

doi:10.16182/j.issn1004731x.joss.22-1480

Abstract

Abstract:

The learning process and working mechanism of deep reinforcement learning methods such as DQN are not transparent, and their decision basis and reliability cannot be perceived, which makes the decisions made by the model highly questionable and greatly limits the application scenarios of deep reinforcement learning. To explain the decision-making mechanism of intelligent agents, this paper proposes a gradient based saliency map generation algorithm SMGG. It uses the gradient information of feature maps generated by high-level convolutional layers to calculate the importance of different feature maps. With the known structure and internal parameters of the model, starting from the last layer of the model, the weight of different feature maps relative to the saliency map is generated by calculating the gradient of feature maps; it classifies the importance of features in both positive and negative directions, and uses weights with positive influence to weight the features captured in the feature map, forming a positive interpretation of the current decision; it uses weights that have a negative impact on other categories to weight the features captured in the feature map, forming a reverse interpretation of the current decision. The saliency map of the decision is generated by the two together, and the basis for the intelligent agent's decision-making behavior is obtained. The effectiveness of this method has been demonstrated through experiments.

Key words: DRL, saliency map, interpretability, agent, gradient

CLC Number:

TP391.9

Wang Yuan, Xu Lin, Gong Xiaoze, Zhang Yongliang, Wang Yongli. Gradient-based Deep Reinforcement Learning Interpretation Methods[J]. Journal of System Simulation, 2024, 36(5): 1130-1140.

Figures/Tables 10

Fig. 1

Fig. 2

Fig. 3

Fig. 4

Fig. 5

Table 1

Fig. 6

Fig. 7

Fig. 8

Table 2

References 19

1	Itti L, Koch C, Niebur E. A Model of Saliency-based Visual Attention for Rapid Scene Analysis[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998, 20(11): 1254-1259.
2	Greydanus S, Koul A, Dodge J, et al. Visualizing and Understanding Atari Agents[C]//Proceedings of the 35th International Conference on Machine Learning. Chia Laguna Resort, Sardinia, Italy: PMLR, 2018: 1792-1801.
3	Iyer R, Li Yuezhang, Li Huao, et al. Transparency and Explanation in Deep Reinforcement Learning Neural Networks[C]//Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society. New York, NY, USA: Association for Computing Machinery, 2018: 144-150.
4	Puri Nikaash, Verma Sukriti, Gupta Piyush, et al. Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution[EB/OL]. (2020-04-03) [2023-12-03]. .
5	Karim M M, Li Yu, Qin Ruwen. Toward Explainable Artificial Intelligence for Early Anticipation of Traffic Accidents[J]. Transportation Research Record, 2022, 2676(6): 743-755.
6	Yoo Hyun, Han Soyoung, Chung Kyungyong. Diagnosis Support Model of Cardiomegaly Based on CNN Using ResNet and Explainable Feature Map[J]. IEEE Access, 2021, 9: 55802-55813.
7	Sun K H, Huh H, Tama B A, et al. Vision-based Fault Diagnostics Using Explainable Deep Learning with Class Activation Maps[J]. IEEE Access, 2020, 8: 129169-129179.
8	Simonyan K, Vedaldi A, Zisserman A. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps[EB/OL]. (2014-04-19) [2023-12-09]. .
9	Zeiler M D, Fergus R. Visualizing and Understanding Convolutional Networks[C]//Computer Vision-ECCV 2014. Cham: Springer International Publishing, 2014: 818-833.
10	Jost Tobias Springenberg, Dosovitskiy Alexey, Brox Thomas, et al. Striving for Simplicity: The All Convolutional Net[J].(2015-04-13)[2023-12-11]..
11	Smilkov D, Thorat N, Kim B, et al. SmoothGrad: Removing Noise by Adding Noise[EB/OL]. (2017-06-12) [2023-12-13]. .
12	Sundararajan M, Taly A, Yan Qiqi. Gradients of Counterfactuals[J].(2016-12-15)[2023-12-18]..
13	Zhou Bolei, Khosla A, Lapedriza A, et al. Learning Deep Features for Discriminative Localization[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway, NJ, USA: IEEE, 2016: 2921-2929.
14	Selvaraju R R, Cogswell M, Das A, et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization[C]//2017 IEEE International Conference on Computer Vision (ICCV). Piscataway, NJ, USA: IEEE, 2017: 618-626.
15	Bany Muhammad M, Yeasin M. Eigen-CAM: Visual Explanations for Deep Convolutional Neural Networks[J]. SN Computer Science, 2021, 2(1): 47.
16	Bengio Yoshua, Courville Aaron, Vincent Pascal. Representation Learning: A Review and New Perspectives[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(8): 1798-1828.
17	Mahendran A, Vedaldi A. Visualizing Deep Convolutional Neural Networks Using Natural Pre-images[J]. International Journal of Computer Vision, 2016, 120(3): 233-255.
18	Mnih V, Badia A P, Mirza M, et al. Asynchronous Methods for Deep Reinforcement Learning[C]//Proceedings of the 33rd International Conference on Machine Learning. Chia Laguna Resort, Sardinia, Italy: PMLR, 2016: 1928-1937.
19	赵佳琦, 张迪, 周勇, 等. 基于深度强化学习的遥感图像可解释目标检测方法[J]. 模式识别与人工智能, 2021, 34(9): 777-786.
	Zhao Jiaqi, Zhang Di, Zhou Yong, et al. Interpretable Object Detection Method for Remote Sensing Image Based on Deep Reinforcement Learning[J]. Pattern Recognition and Artificial Intelligence, 2021, 34(9): 777-786.

名称	跳帧k	重复动作概率 p
Pong-v0	2~4	0.25
Pong-v4	2~4	0
PongDeterministic-v0	4	0.25
PongDeterministic-v4	4	0
PongNoFrameskip-v0	1	0.25
PongNoFrameskip-v4	1	0

方法	准确率	召回率	F1值
SMGG	0.864	0.958	0.908
GPV	0.749	0.979	0.849
PBSM	0.671	0.650	0.661

[1]	Zhu Zilu, Liu Yongkui, Zhang Lin, Wang Lihui, Lin Tingyu. Simulation of Robotic Peg-in-hole Assembly Strategy Based on DRL [J]. Journal of System Simulation, 2024, 36(6): 1414-1424.
[2]	Zhou Zhiyong, Mo Fei, Zhao Kai, Hao Yunbo, Qian Yufeng. Adaptive PID Control Algorithm Based on PPO [J]. Journal of System Simulation, 2024, 36(6): 1425-1432.
[3]	Wang Hongjun, Lin Junqiang, Zou Xiangjun, Zhang Po, Zhou Mingxuan, Zou Weirui, Tang Yunchao, Luo Lufeng. Construction of a Virtual Interactive System for Orchards Based on Digital Twin [J]. Journal of System Simulation, 2024, 36(6): 1493-1508.
[4]	Tang Jinjun, Hu Lipeng, Li Mingyang, Zhang Xuan. Optimization of Highway Emergency Lane Control Based on Kriging Genetic Algorithm [J]. Journal of System Simulation, 2024, 36(5): 1165-1178.
[5]	Yan Xingyu, Li Dayan, Wang Niya, Zhang Kaixiang, Mao Jianlin. Multi-agent Path Planning with Obstacle Penalty Factor [J]. Journal of System Simulation, 2024, 36(3): 673-685.
[6]	An Jing, Si Guangya, Zeng Miaoting. Construction of Surrogate Model Driven by Model and Data [J]. Journal of System Simulation, 2024, 36(3): 756-769.
[7]	Zhao Yingying, Dong Pusen, Zhu Tianchen, Li Fan, Su Yun, Tai Zhenying, Sun Qingyun, Fan Hang. Efficiency Optimization Method for Data Sampling in Power Grid Topology Scheduling Simulation [J]. Journal of System Simulation, 2024, 36(2): 283-295.
[8]	Wang Xinpeng, Fu Huiqiao, Deng Guizhou, Tang Kaiqiang, Chen Chunlin, Liu Canghai. Research on Motion Planning of Hexapod Robot Based on DRL and Free Gait [J]. Journal of System Simulation, 2024, 36(2): 373-384.
[9]	Pan Hainan, Chen Bailiang, Huang Kaihong, Ren Junkai, Cheng Chuang, Lu Huimin, Zhang Hui. Flipper Control Method for Tracked Robot Based on Deep Reinforcement Learning [J]. Journal of System Simulation, 2024, 36(2): 405-414.
[10]	Zhang Guohui, Gao Ang, Zhang Ya'nan. Combat Effectiveness Evaluation Method of Homogeneous Cluster Equipment System Based on RLoMAG+EAS [J]. Journal of System Simulation, 2024, 36(1): 160-169.
[11]	Hu Mingwei, Yang Wenjie. Research on Campus Epidemic Evolution Based on Multi-scale Modeling and Simulation in Microscopic & Microscopic View [J]. Journal of System Simulation, 2024, 36(1): 170-182.
[12]	An Jing, Si Guangya, Zhang Lei. Strategy Optimization Method of Multi-dimension Projection Based on Deep Reinforcement Learning [J]. Journal of System Simulation, 2024, 36(1): 39-49.
[13]	Ma Shanzhi, Wang Hongliang, He Hua, Lun Weicheng. Research on Support Effectiveness Evaluation Method of Equipment Systems Based on PERT and ABMS [J]. Journal of System Simulation, 2023, 35(9): 1837-1846.
[14]	Sun Riming, Guo Hu, Zou Li, Mao Jiaqi, Wang Shengfa. Fall Detection Method of Digital Sequence Based on Fusion Strategy [J]. Journal of System Simulation, 2023, 35(9): 2045-2053.
[15]	Jiayi Liu, Gang Wang, Qiang Fu, Xiangke Guo, Siyuan Wang. Intelligent Air Defense Task Assignment Based on Assignment Strategy Optimization Algorithm [J]. Journal of System Simulation, 2023, 35(8): 1705-1716.