张帆, 栾佳星, 崔东林, 等. 基于SSD-LeNet的矿井移动目标检测与识别方法[J]. 矿业科学学报, 2021, 6(1): 100-108. DOI: 10.19606/j.cnki.jmst.2021.01.011
引用本文: 张帆, 栾佳星, 崔东林, 等. 基于SSD-LeNet的矿井移动目标检测与识别方法[J]. 矿业科学学报, 2021, 6(1): 100-108. DOI: 10.19606/j.cnki.jmst.2021.01.011
Zhang Fan, Luan Jiaxing, Cui Donglin, et al. SSD-LeNet based method of mine moving target detection and recognition[J]. Journal of Mining Science and Technology, 2021, 6(1): 100-108. DOI: 10.19606/j.cnki.jmst.2021.01.011
Citation: Zhang Fan, Luan Jiaxing, Cui Donglin, et al. SSD-LeNet based method of mine moving target detection and recognition[J]. Journal of Mining Science and Technology, 2021, 6(1): 100-108. DOI: 10.19606/j.cnki.jmst.2021.01.011

基于SSD-LeNet的矿井移动目标检测与识别方法

SSD-LeNet based method of mine moving target detection and recognition

  • 摘要: 针对井下雾尘、低照度环境中矿井移动目标检测与识别存在检测精度低、实时性差等问题,提出了一种基于SSD-LeNet的矿井移动目标检测与识别方法。利用视觉传感器捕获矿井移动目标原始图像的一帧来构建模型输入,据此制作含有数字序列位置信息的数据集;离线训练的单镜头多盒检测器(Single Shot multibox Detector,SSD)模型可以输出与自身位置对应的目标特征类别,并利用该训练好的SSD学习模型对测试集中移动目标图片上的数字序列位置进行检测;根据数字序列位置对应的矩形区域进行字符分割操作,将分割后的单个字符依次放入LeNet网络中进行特征识别;识别出的单个字符按顺序合成数字序列快速检索出移动目标的身份信息。研究表明,本文方法与其他深度学习目标检测与识别方法相比,对矿井低照度及噪声环境下的目标检测与识别具有较高的准确率和较强鲁棒性,能够满足实时性要求。

     

    Abstract: Aiming at the problems of low detection accuracy and low real-time performance in the detection and recognition of mine moving targets in foggy and low-illumination environment, a method based on deep convolutional neural network for intelligent detection and recognition of mine moving targets is proposed. The visual sensor is used to capture a frame of the underground mine scene to construct the environment model. The original image of the moving target is used as the model input, and the digital identifier is embedded in the specific position of the moving target image, which creates a data set containing the digital sequence position information. A novel off-line training model named single shot multibox detector(SSD)is presented, which can output target feature categories corresponding to its position. Then, the trained SSD learning model is used to detect the position of the digital sequence on the moving target image in the test set, and characters are split according to the rectangular region corresponding to the position of the digital sequence. Furthermore, the segmented single characters are put into the LeNet-5 network for sequential recognition The recognized single characters are sequentially combined into a digital sequence, thereby quickly retrieving the identity information of the mobile target. The research shows that compared with other target detection and recognition methods, the proposed method has higher accuracy and robustness for target detection and recognition under low-illumination and noisy environment, and can meet real-time requirements.

     

/

返回文章
返回