本文已被:浏览 102次 下载 54次
投稿时间:2025-02-17 修订日期:2025-03-24
投稿时间:2025-02-17 修订日期:2025-03-24
中文摘要: 深度学习在岩石图像岩性识别中,常面临背景干扰和相似岩性类别难以区分的挑战。为此,本文提出了一种基于改进YOLOv11网络的两阶段并行岩石图像分类方法。第一阶段利用YOLOv11分割模型去除岩石图像中的干扰背景,提取出目标岩石对象;第二阶段滑动裁剪分割得到的岩石对象,并行输入改进的YOLOv11n分类模型进行分类,并对预测结果进行阈值处理,以优化相似岩性类别的分类精度。为提升模型对岩石图像细节特征的提取能力,本文在YOLOv11n分类模型中引入超分辨率SAFMN重建模块,并对该模块进行改动以使其适配YOLO网络架构。同时,优化了YOLO网络的损失函数,采用GHM-Loss策略,以平衡不同类型样本对模型训练的贡献,提高模型训练效率与可靠性。本文构建了岩石图像分割和分类数据集,并在这些数据集上进行了相关实验,YOLOv11分割模型的对比实验结果表明,YOLOv11分割模型在岩石图像分割任务中的表现优于其他模型;改进的YOLOv11n分类模型在消融实验中相比基线模型在精度、召回率和准确率上分别提高了12%、1.7%和0.4%。在最终的方法对比实验中,提出的两阶段并行岩石图像分类方法的准确率平均提高了22.5%,验证了该方法在岩石图像分类任务中的优越性和有效性。
Abstract:In rock image lithology identification, deep learning frequently encounters challenges such as background interference and the difficulty in distinguishing similar lithology categories. To address these issues, this paper introduces a two-stage parallel rock image classification method based on an enhanced YOLOv11 network. In the first stage, the YOLOv11 segmentation model is employed to eliminate background noise from rock images and isolate the target rock objects. In the second stage, rock objects obtained through sliding window clipping are classified in parallel using the improved YOLOv11n classification model, with threshold processing applied to the prediction results to enhance the accuracy of classifying similar lithology categories. To improve the model"s ability to capture detailed features of rock images, this study incorporates a super-resolution SAFMN reconstruction module into the YOLOv11n classification model, adapting it to fit the YOLO network architecture. Additionally, the loss function of the YOLO network has been optimized by adopting the GHM-Loss strategy, which balances the contribution of different sample types during model training, thereby enhancing both the efficiency and reliability of the training process. This paper constructs datasets for rock image segmentation and classification and conducts relevant experiments on these datasets. The comparative experimental results demonstrate that the YOLOv11 segmentation model outperforms other models in rock image segmentation tasks. Compared to the baseline model, the enhanced YOLOv11n classification model achieves improvements of 12% in precision (P), 1.7% in recall rate (R), and 0.4% in accuracy rate (A). In the final method comparison experiment, the proposed two-stage parallel rock image classification method demonstrates an average improvement of 22.5% in accuracy, validating its superiority and effectiveness in rock image classification tasks.
keywords: Deep learning Lithology identification Image classification Image segmentation YOLO model improvements
文章编号: 中图分类号: 文献标志码:
基金项目:国家自然科学基金青年科学基金
| Author Name | Affiliation | |
| Li Tianwen | School of Earth Sciences,Yangtze University | 2023710529@yangtzeu.edu.cn |
| Li Gongquan | School of Earth Sciences,Yangtze University | 3258174500@qq.com |
引用文本:
李天文,李功权.基于优化 YOLO11n 网络的两阶段并行图像 岩性识别方法[J].有色金属(选矿部分),2025(10):46-62.
Li Tianwen,Li Gongquan.Two-stage parallel image based on optimized YOLO11 networkLithology identification method[J].Nonferrous Metals(Mineral Processing Section),2025(10):46-62.
李天文,李功权.基于优化 YOLO11n 网络的两阶段并行图像 岩性识别方法[J].有色金属(选矿部分),2025(10):46-62.
Li Tianwen,Li Gongquan.Two-stage parallel image based on optimized YOLO11 networkLithology identification method[J].Nonferrous Metals(Mineral Processing Section),2025(10):46-62.

关注微信公众号