Advanced Search
Zhang N, Zhou YC, Xu KC, et al. Research on an improved YOLOv8 object detection algorithm based on MobileNetV4 J. Journal of Integration Technology, 2026, 15(2): 91-103. DOI: 10.12146/j.issn.2095-3135.20250320001
Citation: Zhang N, Zhou YC, Xu KC, et al. Research on an improved YOLOv8 object detection algorithm based on MobileNetV4 J. Journal of Integration Technology, 2026, 15(2): 91-103. DOI: 10.12146/j.issn.2095-3135.20250320001

Research on an Improved YOLOv8 Object Detection Algorithm Based on MobileNetV4

  • The lightweight convolutional neural network designed for mobile devices features fast inference speed but is constrained by its inherent locality. Local information can only be captured within a windowed region, leading to performance degradation. Introducing the self-attention mechanism can capture global information, but it reduces detection speed. To address these issues, this paper introduces a hardware-friendly MobileNetV4 network architecture based on YOLOv8, incorporating a universally inverted bottleneck search block that integrates the inverted bottleneck, ConvNext, Feed Forward network, and a novel variant of extra depthwise convolution. Additionally, a dynamic upsampling operator is introduced to improve the upsampling operation, reducing GPU memory usage and latency in the model. Furthermore, this paper enhances the detection head of YOLOv8 by introducing a dynamic detection head, which combines spatial awareness, scale awareness, and task awareness into a unified framework. It effectively applies the attention mechanism in the object detection head, improving detection performance and efficiency. The experimental results demonstrate that compared to the next-best model, YOLOv8n, YOLOv8n_M achieved an improvement of 1.3% in mean average precision (mPA50~95). In terms of model complexity, YOLOv8n_M successfully compresses the model size by 36% (with a reduction of 1 million parameters) and reduces computational costs by 26% (The giga floating-point operations (GFLOPs) were reduced by 2.4). The proposed YOLOv8n_M effectively reduces the model’s parameter count and inference time while improving object detection accuracy in various environments to a certain extent.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return