[1] Kavan L.Real-time Skeletal Animation[D]. Prague, Czech Republic: Czech Technical University, 2007. [2] 赵维, 谢晓方. 虚拟人技术发展现状及其在工程中的应用[J]. 系统仿真学报, 2009, 21(17): 5473-5476. (ZHAO Wei, XIE Xiao-fang.Development of Virtual Human Technology and Its Engineering Application[J]. Journal of System Simulation (S1004-731X), 2009, 21(17): 5473-5476.) [3] Burtnyk N, Wein M.Interactive Skeleton Techniques for Enhancing Motion Dynamics in Key Frame Animation[J]. Communications of the ACM (S0001-0782), 1976, 19(10): 564-569. [4] Chafi H, Sujeeth A K, Brown K J, et al. A Domain-specific Approach to Heterogeneous Parallelism[C]// Proceedings of the 16th ACM symposium on Principles and practice of parallel programming. USA: ACM, 2011: 35-46. [5] Brown K J, Sujeeth A K, Lee H J, et al. A Heterogeneous Parallel Framework for Domain-specific Languages[C]// Parallel Architectures and Compilation Techniques (PACT). USA: IEEE, 2011: 89-100. [6] Ismail L, GuerchiD. Performance Evaluation of Convolution on the Cell Broadband Engine Processor[J]. IEEE Transactions on Parallel and Distributed Systems (S1045-9219), 2011, 22(2): 337-351. [7] Lindholm E, Kilgard M J, Moreton H.A User-programmable Vertex Engine[C]// Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques. USA: ACM, 2001: 149-158. [8] 季卓尔, 张景峤. 基于可编程 GPU 的骨骼动画[J]. 计算机工程与应用, 2008, 44(22): 77-80. [9] 胡前亮, 陈炳发. 一种采用CUDA的骨骼动画阴影实时仿真方法[J]. 小型微型计算机系统, 2011, 32(1):165-168. [10] 郝爱民, 赵永涛, 吴伟和, 等. 任意姿态虚拟人网格模型骨骼提取算法[J]. 中国图象图形学报, 2011, 16(6): 1008-1014. [11] 刘登志. 人体角色的自动绑定与卡通运动[D]. 杭州:浙江大学, 2011. [12] Yan H B, Hu S M, Martin R R, et al. Shape Deformation Using a Skeleton to Drive Simplex Transformations[J]. IEEE Transactions on Visualization and Computer Graphics (S1077-2626), 2008, 14(3): 693-706. [13] Lee V W, Kim C, Chhugani J, et al. Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU[C]// ACM SIGARCH Computer Architecture News. USA: ACM, 2010: 451-460. [14] Shi G, Li M, Lipasti M.Accelerating Search and Recognition Workloads with SSE 4.2 String and Text Processing Instructions[C]// Performance Analysis of Systems and Software (ISPASS), 2011 IEEE International Symposium on. USA: IEEE, 2011: 145-153. [15] Shen J, Fang J, Sips H, et al. Performance Gaps between OpenMP and OpenCL for Multi-core CPUs[C]// Parallel Processing Workshops (ICPPW), 2012 41st International Conference on. Pittsburgh, PA, USA: IEEE, 2012: 116-125. [16] Pennycook S J, Hammond S D, Wright S A, et al. An Investigation of the Performance Portability of OpenCL[J]. Journal of Parallel and Distributed Computing (S0743-7315), 2013, 73(11): 1439-1450. [17] Khronos. The OpenCL Specification [EB/OL]. (2012) [2012-11-14]. http://www.khronos.org/r egistry/cl/specs/ opencl-1.2.pdf. [18] Du P, Weber R, Luszczek P, et al. From CUDA to OpenCL: Towards a Performance-portable Solution for Multi-platform GPU Programming[J]. Parallel Computing (S0167-8191), 2012, 38(8): 391-407. [19] Sun X H, Chen Y.Reevaluating Amdahl’s Law in the Multicore Era[J]. Journal of Parallel and Distributed Computing (S0743-7315), 2010, 70(2): 183-188. |