本文已被:浏览 93次 下载 54次
投稿时间:2025-03-13 修订日期:2025-04-01
投稿时间:2025-03-13 修订日期:2025-04-01
中文摘要: 针对传统煤矸石分选方法存在的时效性及可靠性差等问题,提出了一种基于ST-YOLOX-S的多尺度煤矸石识别方法,利用机器视觉和图像处理技术,通过深度学习模型提取可见光图像特征,实现煤矸石的高效识别。在YOLOX-S的基础上,引入Swin-Transformer建立ST-YOLOX-S模型,以增强全局特征提取能力,并通过多尺度混合扩张卷积技术捕获不同尺度的特征信息,有效解决了煤矸石目标全局特征提取不足、尺度多样性和形态复杂性问题,显著提升了模型的检测性能。在基于选煤厂实际生产环境条件下的自建煤矸石数据集进行实验,结果表明,在95%置信度水平下,ST-YOLOX-S模型的预测准确率达到90.11%,相较于原始YOLOX-S模型提高了7.30%。对比改进后的ST-YOLOX-S算法与其他主流目标检测算法,ST-YOLOX-S算法精确度为0.89%,参数量为7.80 MB,召回率为0.92%,显著优于YOLOV4、YOLOV5、RCNN和EfficientDet及CenterNet算法。消融实验进一步证实了ST-YOLOX-S模型中各个组件的有效性,在添加了多尺度混合扩张卷积和替换Swin-Transformer主干网络之后,YOLOX的精确度、召回率与FPS值分别提高了5.95%、10.84%和25.48%。最后,使用ST-YOLOX-S进行测试,改进后模型在检测目标时出现重框的现象更小,检测的概率值更高,表明其在煤矸石检测中的优越性能和实际应用价值,这对提高煤炭清洁高效利用具有重要意义。
中文关键词: 煤矸石分选 深度学习 YOLOX 机器视觉 Swin-Transformer模型
Abstract:In response to the issues of poor timeliness and reliability in traditional coal gangue separation methods, a multi-scale coal gangue recognition method based on ST-YOLOX-S is proposed. This method utilizes machine vision and image processing technology to efficiently identify coal gangue by extracting visible light image features through deep learning models. Building upon YOLOX-S, the ST-YOLOX-S model is established by introducing Swin-Transformer to enhance global feature extraction capability. Additionally, the use of multi-scale mixed expandable convolutional technology captures feature information of different scales, effectively addressing the insufficient global feature extraction, scale diversity, and complex morphology problems associated with coal gangue targets. As a result, the detection performance of the model is significantly improved. Experiments conducted on a self-built dataset of coal gangue under actual production conditions at a coal preparation plant demonstrate that the prediction accuracy of the ST-YOLOX-S model reaches 90.11% at a confidence level of 95%, representing an improvement of 7.30% compared to the original YOLOX-S model. When compared with other mainstream object detection algorithms such as YOLOV4, YOLOV5, RCNN, EfficientDet and CenterNet algorithms, the precision rate (0.89%), parameter count (7.80 MB), and recall rate (0.92%) of the ST-YOLOX-S algorithm are significantly superior. Further ablation experiments confirm the effectiveness of each component in the ST-YOLOX-S model; after adding multi-scale mixed expandable convolutional layers and replacing Swin-Transformer as backbone network components into YOLOX-S, the precision rate, recall rate, and FPS values have increased by 5.95%, 10.84%, 25.48%, respectively. Finally, testing using ST-YOLOX-S shows that after improving this model, there is less occurrence of overlapping boxes when detecting targets, and higher probability values detected which indicates its superior performance in coal gangue detection. This has important significance for promoting clean and efficient utilization of coal.
文章编号: 中图分类号: 文献标志码:
基金项目:江苏省杰出青年资助(编号:BK20240105)
引用文本:
于大伟,邵明,崔萌,姜坤坤,郭东东,王得全,陈彪,石宇含,张亚东.基于ST-YOLOX-S的多尺度煤矸石识别研究[J].有色金属(选矿部分),2025(10):63-72.
YU Dawei,SHAO Ming,CUI Meng,JIANG Kunkun,GUO Dongdong,WANG Dequan,CHEN Biao,SHI Yuhan,ZHANG Yadong.Research on Multi scale Coal Gangue Identification Based on ST-YOLOX-S[J].Nonferrous Metals(Mineral Processing Section),2025(10):63-72.
于大伟,邵明,崔萌,姜坤坤,郭东东,王得全,陈彪,石宇含,张亚东.基于ST-YOLOX-S的多尺度煤矸石识别研究[J].有色金属(选矿部分),2025(10):63-72.
YU Dawei,SHAO Ming,CUI Meng,JIANG Kunkun,GUO Dongdong,WANG Dequan,CHEN Biao,SHI Yuhan,ZHANG Yadong.Research on Multi scale Coal Gangue Identification Based on ST-YOLOX-S[J].Nonferrous Metals(Mineral Processing Section),2025(10):63-72.

关注微信公众号