A YOLO-NL object detector for real-time detection (2024)

research-article

Author: Yan Zhou

Published: 27 February 2024 Publication History

  • 0citation
  • 0
  • Downloads

Metrics

Total Citations0Total Downloads0

Last 12 Months0

Last 6 weeks0

  • Get Citation Alerts

    New Citation Alert added!

    This alert has been successfully added and will be sent to:

    You will be notified whenever a record that you have chosen has been cited.

    To manage your alert preferences, click on the button below.

    Manage my Alerts

    New Citation Alert!

    Please log in to your account

      • View Options
      • References
      • Media
      • Tables
      • Share

    Abstract

    In recent years, YOLO object detection models have undergone significant advancement due to the success of novel deep convolutional networks. The success of these YOLO models is often attributed to their use of guidance techniques, such as expertly tailored deeper backbone and meticulously crafted detector head, which provides effective mechanisms to tradeoff between accuracy and efficiency. However, these sluggish-reasoning models are not capable of handling false detection and negative phenomena, facing challenges include improving the robustness of scaled objects detection against occlude and densely sophisticated scenarios.

    To address these limitations, we propose a novel object detector, You Only Look Once and None Left (YOLO-NL). Our model includes a novel global dynamic label assignment strategy, which allocates labels for specific anchors to maintain a balance between higher precision detection and finer localization. To enhance the detection capability of multi-scale objects in complex scenes, we separately upgrade CSPNet and PANet using the shortest-longest gradient strategy and self-attention mechanism. To meet the need for fast inference, we propose the Rep-CSPNet network using the reparameterization method to convert residual convolutions to ghost linear operations. Additionally, we accelerate the feature extraction process by deploying the serial SSPP structure. The proposed model is robust to scale objects against negative effectives such as dust, dense, ambiguous, and obstructed scenes. YOLO-NL achieved a mAP of 52.9% on the COCO 2017 test dataset, exhibiting a significant improvement of 2.64% compared to the baseline YOLOX. It is worth noting that YOLO-NL can perform high-accuracy and high-speed face mask detection in real-life scenarios. The YOLO-NL model was employed on self-built FMD and large open-source datasets, and the results show that it outperforms the other state-of-the-art methods, achieving 98.8% accuracy while maintaining 130 FPS.

    References

    [1]

    A.H. Ashraf, M. Imran, A.M. Qahtani, A. Alsufyani, O. Almutiry, A. Mahmood, …., M. Habib, Weapons detection for security and video surveillance using cnn and YOLO-v5s, CMC-Comput. Mater. Contin 70 (2022) 2761–2775.

    [2]

    Bochkovskiy, A., Wang, C.-Y., & Liao, H.-Y. M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934.

    [3]

    A. Cabani, K. Hammoudi, H. Benhabiles, M. Melkemi, MaskedFace-Net–A dataset of correctly/incorrectly masked face images in the context of COVID-19, Smart Health 19 (2021).

    [4]

    N. Carion, F. Massa, G. Synnaeve, N. Usunier, A. Kirillov, S. Zagoruyko, End-to-end object detection with transformers, European conference on computer vision, 2020.

    [5]

    Cota, D. A. M. (2020). Monitoring COVID-19 prevention measures on CCTV cameras using deep learning. Politecnico di Torino.

    [6]

    N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05), 2005.

    [7]

    H. Deng, J. Zhang, L. Chen, M. Cai, Improved mask wearing detection algorithm for SSD, Journal of Physics: Conference Series. (2021).

    [8]

    B.S.B. Dewantara, D.T. Rhamadhaningrum, Detecting multi-pose masked face using adaptive boosting and cascade classifier, 2020 International Electronics Symposium (IES), 2020.

    [9]

    X. Ding, X. Zhang, J. Han, G. Ding, Diverse branch block: Building a convolution as an inception-like unit, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021.

    [10]

    X. Ding, X. Zhang, J. Han, G. Ding, Scaling up your kernels to 31x31: Revisiting large kernel design in cnns, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022.

    [11]

    X. Ding, X. Zhang, N. Ma, J. Han, G. Ding, J. Sun, Repvgg: Making vgg-style convnets great again, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021.

    [12]

    Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., & Gelly, S. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929.

    [13]

    T. Fang, X. Huang, J. Saniie, Design flow for real-time face mask detection using PYNQ system-on-chip platform, 2021 IEEE International Conference on Electro Information Technology (EIT), 2021.

    [14]

    C. Feng, Y. Zhong, Y. Gao, M.R. Scott, W. Huang, Tood: Task-aligned one-stage object detection, 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2021.

    [15]

    Z. Ge, S. Liu, Z. Li, O. Yoshie, J. Sun, Ota: Optimal transport assignment for object detection, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021.

    [16]

    Ge, Z., Liu, S., Wang, F., Li, Z., & Sun, J. (2021). Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430.

    [17]

    R. Girshick, Fast r-cnn, Proceedings of the IEEE international conference on computer vision, 2015.

    [18]

    K. Han, Y. Wang, Q. Tian, J. Guo, C. Xu, C. Xu, Ghostnet: More features from cheap operations, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020.

    [19]

    K. He, X. Zhang, S. Ren, J. Sun, Spatial pyramid pooling in deep convolutional networks for visual recognition, IEEE transactions on pattern analysis and machine intelligence 37 (9) (2015) 1904–1916.

    Digital Library

    [20]

    J. Hu, L. Shen, G. Sun, Squeeze-and-excitation networks, Proceedings of the IEEE conference on computer vision and pattern recognition, 2018.

    [21]

    F. Isensee, P.F. Jaeger, S.A. Kohl, J. Petersen, K.H. Maier-Hein, nnU-Net: A self-configuring method for deep learning-based biomedical image segmentation, Nature methods 18 (2) (2021) 203–211.

    [22]

    X. Jiang, T. Gao, Z. Zhu, Y. Zhao, Real-time face mask detection method based on YOLOv3, Electronics 10 (7) (2021) 837.

    [23]

    Li, C., Li, L., Geng, Y., Jiang, H., Cheng, M., Zhang, B., Ke, Z., Xu, X., & Chu, X. (2023). Yolov6 v3. 0: A full-scale reloading. arXiv preprint arXiv:2301.05586.

    [24]

    Li, C., Li, L., Jiang, H., Weng, K., Geng, Y., Li, L., Ke, Z., Li, Q., Cheng, M., & Nie, W. (2022). YOLOv6: A single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976.

    [25]

    H. Lin, R. Tse, S.-K. Tang, Y. Chen, W. Ke, G. Pau, Near-realtime face mask wearing recognition based on deep learning, 2021 IEEE 18th Annual Consumer Communications & Networking Conference (CCNC), 2021.

    [26]

    S. Lin, L. Cai, X. Lin, R. Ji, Masked face detection via a modified LeNet, Neurocomputing 218 (2016) 197–202.

    [27]

    S. Lin, T. Yu, R. Feng, X. Li, X. Yu, L. Xiao, Z. Chen, Local patch autoaugment with multi-agent collaboration, IEEE Transactions on Multimedia (2023).

    [28]

    T.-Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, S. Belongie, Feature pyramid networks for object detection, Proceedings of the IEEE conference on computer vision and pattern recognition, 2017.

    [29]

    S. Liu, L. Qi, H. Qin, J. Shi, J. Jia, Path aggregation network for instance segmentation, Proceedings of the IEEE conference on computer vision and pattern recognition, 2018.

    [30]

    Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., & Berg, A. C. (2016). Ssd: Single shot multibox detector. Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14.

    [31]

    Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., & Guo, B. (2021). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE/CVF international conference on computer vision.

    [32]

    M. Loey, G. Manogaran, M.H.N. Taha, N.E.M. Khalifa, Fighting against COVID-19: A novel deep learning model based on YOLO-v2 with ResNet-50 for medical face mask detection, Sustainable cities and society 65 (2021).

    [33]

    F. Mercaldo, A. Santone, Transfer learning for mobile real-time face mask detection and localization, Journal of the American Medical Informatics Association 28 (7) (2021) 1548–1554.

    [34]

    Nieto-Rodriguez, A., Mucientes, M., & Brea, V. M. (2015). System for medical mask detection in the operating room through facial attributes. Pattern Recognition and Image Analysis: 7th Iberian Conference, IbPRIA 2015, Santiago de Compostela, Spain, June 17-19, 2015, Proceedings 7.

    [35]

    T. Ojala, M. Pietikäinen, D. Harwood, A comparative study of texture measures with classification based on featured distributions, Pattern recognition 29 (1) (1996) 51–59.

    [36]

    G. Oreski, YOLO* C—Adding context improves YOLO performance, Neurocomputing 555 (2023).

    [37]

    C. Papageorgiou, T. Poggio, A trainable system for object detection, International journal of computer vision 38 (2000) 15–33.

    [38]

    N. Petrović, Đ. Kocić, IoT-based system for COVID-19 indoor safety monitoring, IcETRAN Belgrade, 2020.

    [39]

    B. Qin, D. Li, Identifying facemask-wearing condition using image super-resolution with classification network to prevent COVID-19, Sensors 20 (18) (2020) 5236.

    [40]

    M. Razavi, H. Alikhani, V. Janfaza, B. Sadeghi, E. Alikhani, An automatic system to monitor the physical distance and face mask wearing of construction workers in COVID-19 pandemic, SN computer science 3 (2022) 1–8.

    [41]

    Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. Proceedings of the IEEE conference on computer vision and pattern recognition.

    [42]

    Redmon, J., & Farhadi, A. (2017). YOLO9000: better, faster, stronger. Proceedings of the IEEE conference on computer vision and pattern recognition.

    [43]

    Redmon, J., & Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767.

    [44]

    S. Ren, K. He, R. Girshick, J. Sun, Faster r-cnn: Towards real-time object detection with region proposal networks, Advances in Neural Information Processing Systems 28 (2015).

    Digital Library

    [45]

    Rudraraju, S. R., Suryadevara, N. K., & Negi, A. (2020). Face mask detection at the fog computing gateway. 2020 15th Conference on Computer Science and Information Systems (FedCSIS).

    [46]

    V. Sharma, Face mask detection using yolov5 for COVID-19, California State University San Marcos], 2020.

    [47]

    Somaldo, P., Ferdiansyah, F. A., Jati, G., & Jatmiko, W. (2020). Developing smart COVID-19 social distancing surveillance drone using YOLO implemented in robot operating system simulation environment. 2020 IEEE 8th R10 humanitarian technology conference (R10-HTC).

    [48]

    M. Sun, H. Zhang, Z. Huang, Y. Luo, Y. Li, Road infrared target detection with I-YOLO, IET Image Processing 16 (1) (2022) 92–101.

    [49]

    J.S. Talahua, J. Buele, P. Calvopiña, J. Varela-Aldás, Facial recognition system for people with and without face mask in times of the covid-19 pandemic, Sustainability 13 (12) (2021) 6900.

    [50]

    Tan, M., Pang, R., & Le, Q. V. (2020). Efficientdet: Scalable and efficient object detection. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition.

    [51]

    Tian, Z., Shen, C., Chen, H., & He, T. (2019). Fcos: Fully convolutional one-stage object detection. Proceedings of the IEEE/CVF international conference on computer vision.

    [52]

    Viola, P., & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE computer society conference on computer vision and pattern recognition. CVPR 2001.

    [53]

    B. Wang, Y. Zhao, C.P. Chen, Hybrid transfer learning and broad learning system for wearing mask detection in the COVID-19 era, IEEE Transactions on Instrumentation and Measurement 70 (2021) 1–12.

    [54]

    Wang, C.-Y., Bochkovskiy, A., & Liao, H.-Y. M. (2021). Scaled-yolov4: Scaling cross stage partial network. Proceedings of the IEEE/cvf conference on computer vision and pattern recognition.

    [55]

    Wang, C.-Y., Bochkovskiy, A., & Liao, H.-Y. M. (2023). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.

    [56]

    Wang, C.-Y., Liao, H.-Y. M., Wu, Y.-H., Chen, P.-Y., Hsieh, J.-W., & Yeh, I.-H. (2020). CSPNet: A new backbone that can enhance learning capability of CNN. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops.

    [57]

    Woo, S., Park, J., Lee, J.-Y., & Kweon, I. S. (2018). Cbam: Convolutional block attention module. Proceedings of the European conference on computer vision (ECCV).

    [58]

    L. Xu, W. Yan, J. Ji, The research of a novel WOG-YOLO algorithm for autonomous driving object detection, Scientific reports 13 (1) (2023) 3699.

    [59]

    Xu, X., Jiang, Y., Chen, W., Huang, Y., Zhang, Y., & Sun, X. (2022). Damo-yolo: A report on real-time object detection design. arXiv preprint arXiv:2211.15444.

    [60]

    Yang, G., Feng, W., Jin, J., Lei, Q., Li, X., Gui, G., & Wang, W. (2020). Face mask recognition system with YOLOV5 based on image recognition. 2020 IEEE 6th International Conference on Computer and Communications (ICCC).

    [61]

    Zereen, A. N., Corraya, S., Dailey, M. N., & Ekpanyapong, M. (2021). Two-stage facial mask detection model for indoor environments. Proceedings of International Conference on Trends in Computational and Cognitive Engineering: Proceedings of TCCE 2020.

    [62]

    Zhang, H., Cisse, M., Dauphin, Y. N., & Lopez-Paz, D. (2017). mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412.

    [63]

    Zhang, H., Li, F., Liu, S., Zhang, L., Su, H., Zhu, J., Ni, L. M., & Shum, H.-Y. (2022). Dino: Detr with improved denoising anchor boxes for end-to-end object detection. arXiv preprint arXiv:2203.03605.

    [64]

    J. Zhang, F. Han, Y. Chun, W. Chen, A novel detection framework about conditions of wearing face mask for helping control the spread of COVID-19, IEEE Access 9 (2021) 42975–42984.

    [65]

    Zhang, S., Chi, C., Yao, Y., Lei, Z., & Li, S. Z. (2020). Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition.

    [66]

    Zhou, X., Wang, D., & Krähenbühl, P. (2019). Objects as points. arXiv preprint arXiv:1904.07850.

    [67]

    Y. Zhou, IYOLO-NL: An improved you only look once and none left object detector for real-time face mask detection, Heliyon 9 (8) (2023).

    [68]

    Zhu, B., Wang, J., Jiang, Z., Zong, F., Liu, S., Li, Z., & Sun, J. (2020). Autoassign: Differentiable label assignment for dense object detection. arXiv preprint arXiv:2007.03496.

    Index Terms

    1. A YOLO-NL object detector for real-time detection

      1. Computing methodologies

        1. Artificial intelligence

          1. Computer vision

            1. Computer vision problems

              1. Computer vision tasks

                1. Scene understanding

            2. Machine learning

          Index terms have been assigned to the content through auto-classification.

          Recommendations

          • SMD-YOLO: An efficient and lightweight detection method for mask wearing status during the COVID-19 pandemic

            Highlights

            • A face mask detection and monitoring system is developed by using an improved variant of YOLOv4-tiny.
            • A novel detector is proposed for mask wearing status in a complicated environment during the COVID-19 pandemic.
            • Anchor boxes of K-...

            Abstract Background and Objective

            At present, the COVID-19 epidemic is still spreading worldwide and wearing a mask in public areas is an effective way to prevent the spread of the respiratory virus. Although there are many deep learning methods used for ...

            Read More

          • YOLO-face: a real-time face detector

            Abstract

            Face detection is one of the important tasks of object detection. Typically detection is the first stage of pattern recognition and identity authentication. In recent years, deep learning-based algorithms in object detection have grown rapidly. ...

            Read More

          • Tiny Object Detector for Pulmonary Nodules based on YOLO

            ICDLT '23: Proceedings of the 2023 7th International Conference on Deep Learning Technologies

            Accurate detection and discovery of early lung cancer is the most effective measure to reduce lung cancer mortality with high clinical value. However, existing common object detectors show unsatisfactory detection accuracy for pulmonary nodule ...

            Read More

          Comments

          Information & Contributors

          Information

          Published In

          A YOLO-NL object detector for real-time detection (2)

          Expert Systems with Applications: An International Journal Volume 238, Issue PE

          Mar 2024

          1586 pages

          ISSN:0957-4174

          Issue’s Table of Contents

          Elsevier Ltd.

          Publisher

          Pergamon Press, Inc.

          United States

          Publication History

          Published: 27 February 2024

          Author Tags

          1. YOLO
          2. CSPNet
          3. Advanced label assignment
          4. Face mask detection
          5. Network design

          Qualifiers

          • Research-article

          Contributors

          A YOLO-NL object detector for real-time detection (3)

          Other Metrics

          View Article Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Total Citations

          • Total Downloads

          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0

          Other Metrics

          View Author Metrics

          Citations

          View Options

          View options

          Get Access

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          Get this Publication

          Media

          Figures

          Other

          Tables

          A YOLO-NL object detector for real-time detection (2024)
          Top Articles
          Latest Posts
          Article information

          Author: Roderick King

          Last Updated:

          Views: 5425

          Rating: 4 / 5 (51 voted)

          Reviews: 82% of readers found this page helpful

          Author information

          Name: Roderick King

          Birthday: 1997-10-09

          Address: 3782 Madge Knoll, East Dudley, MA 63913

          Phone: +2521695290067

          Job: Customer Sales Coordinator

          Hobby: Gunsmithing, Embroidery, Parkour, Kitesurfing, Rock climbing, Sand art, Beekeeping

          Introduction: My name is Roderick King, I am a cute, splendid, excited, perfect, gentle, funny, vivacious person who loves writing and wants to share my knowledge and understanding with you.