基于深度强化学习的对手建模方法研究综述 |
徐浩添, 秦龙, 曾俊杰, 胡越, 张琪 |
Research Progress of Opponent Modeling Based on Deep Reinforcement Learning |
Haotian Xu, Long Qin, Junjie Zeng, Yue Hu, Qi Zhang |
图4 递归推理过程概率图模型 |
Fig. 4 Probabilistic graph model of recursive inference process |
![]() |