基于时空增强生成模型的协同音频人体全身动作生成
张硕喆, 宋文凤, 侯霞, 李帅
Full-body Co-speech Gesture Generation Based on Spatial-temporal Enhanced Generation Model
Zhang Shuozhe, Song Wenfeng, Hou Xia, Li Shuai
系统仿真学报
.
2026, (1): 211
-224
.
DOI: 10.16182/j.issn1004731x.joss.25-0833