系统仿真学报 ›› 2016, Vol. 28 ›› Issue (9): 2260-2266.

• 短文 • 上一篇    下一篇

基于ANNet网络的RGB-D图像的目标检测

蔡强, 魏立伟, 李海生, 曹健   

  1. 北京工商大学计算机与信息工程学院食品安全大数据技术北京市重点实验室,北京 100048
  • 收稿日期:2016-05-10 修回日期:2016-07-11 出版日期:2016-09-08 发布日期:2020-08-14
  • 作者简介:蔡强(1969-),男,重庆,博士,教授,硕导,研究方向为计算机图形学,计算几何,科学可视化,智能信息处理;魏立伟(1987-),女,河北,硕士生,研究方向为图像识别、机器学习。
  • 基金资助:
    北京市自然科学基金(4162019)

Object Detection in RGB-D Image Based on ANNet

Cai Qiang, Wei Liwei, Li Haisheng, Cao Jian   

  1. Beijing Key Laboratory of Big Data Technology for Food Safety, School of Computer and Information Engineering, Beijing Technology and Business University, Beijing 100048, China
  • Received:2016-05-10 Revised:2016-07-11 Online:2016-09-08 Published:2020-08-14

摘要: 由于深度图像采集设备的广泛使用,使得利用RGB-D图像进行目标检测成为计算机视觉领域研究热点。为了使得利用卷积神经网络所提取的特征更具有鲁棒性,设计了一种改进的卷积神经网络(本文称为ANNet),以提高检测准确率。为了提高卷积层中局部感受区域的模型分辨能力,针对AlexNet网络中卷积层中卷积核与下层数据块的线性特性,将部分卷积层改进为带有多层感知机的非线性卷积层。在NYUD2数据集上实验,结果表明,使用改进后的网络结构,在彩色图像上的检测结果提升了3%,在RGB-D图像上的检测结果提升了4%。

关键词: 目标检测, 卷积神经网络, AlexNet网络, RGB-D图像

Abstract: The wide spread of depth images acquisition devices makes object detection in RGB-D images a hotspot in the field of computer vision. In order to make the features extracted by CNN more robust and to improve the detection accuracy, an improved CNN called ANNet was designed. To enhance the model discriminability of local patches within the receptive field, some linear convolutional layers in the AlexNet with nonlinear convolutional layers were replaced which contained multilayer perceptron against the linear feature between convolution filter and underlying data patch. The experiment result shows that the detection accuracy is improved by 3% in the RGB images and 4% in the RGB-D images on the NYUD2 datasets using the improved network.

Key words: object detection, convolutional neural network, AlexNet, RGB-D images

中图分类号: