123,123

基于卷积神经网络的图像分类模型综述*

电子技术应用 2023年9月

郭庆梅1，于恒力2，王中训1，刘宁波2

（1.烟台大学物理与电子信息学院，山东烟台 264005；2.海军航空大学信息融合研究所，山东烟台 264001）

摘要： 卷积神经网络在计算机视觉等领域占有一席之地，利用局部连接、权值共享以及池化操作等特性，有效地提取图像的局部特征，降低网络复杂度，具有更少的参数量和更好的鲁棒性，因此，吸引了众多研究者的关注，使分类模型朝着更轻、更快、更高效的方向迅速发展。按照卷积神经网络发展的时间线，介绍了常用的典型网络模型，剖析了其创新点与优缺点，并对其未来的发展方向进行了展望。

關(guān)鍵詞： 卷积神经网络计算机视觉特征提取分类模型

中圖分類號(hào)：TP183 文獻(xiàn)標(biāo)志碼：A DOI: 10.16157/j.issn.0258-7998.233909
中文引用格式： 郭慶梅，于恒力，王中訓(xùn)，等. 基于卷積神經(jīng)網(wǎng)絡(luò)的圖像分類模型綜述[J]. 電子技術(shù)應(yīng)用，2023，49(9)：31-38.
英文引用格式： Guo Qingmei，Yu Hengli，Wang Zhongxun，et al. Review of image classification models based on convolutional neural networks[J]. Application of Electronic Technique，2023，49(9)：31-38.

Review of image classification models based on convolutional neural networks

Guo Qingmei1，Yu Hengli2，Wang Zhongxun1，Liu Ningbo2

(1.School of Physics and Electronic Information， Yantai University， Yantai 264005， China； 2.Information Fusion Institute， Naval Aviation University， Yantai 264001， China)

Abstract： Convolutional neural networks have established themselves as a prominent technique in computer vision and related fields. By leveraging features such as local connections, weight sharing, and pooling operations, these networks are able to effectively extract local features from images, reducing network complexity, and exhibiting fewer parameters and greater robustness. As a result, they have garnered significant attention from researchers and have led to the rapid development of classification models that are lighter, faster, and more efficient. This article presents a timeline of typical network models used in convolutional neural network development, analyzes their innovative points and advantages and disadvantages, and offers insights into their future development directions.

Key words : convolutional neural network；computer vision；feature extraction；classification model

0　引言

卷積神經(jīng)網(wǎng)絡(luò)[1]是一種深度學(xué)習(xí)模型，主要應(yīng)用于圖像和視頻等數(shù)據(jù)的識(shí)別與分類。2012年Alex Krizhevsky等人[2]在ImageNet大賽中使用CNN大幅度超越傳統(tǒng)方法，CNN一躍成為計(jì)算機(jī)視覺領(lǐng)域的熱門技術(shù)。其具有表征學(xué)習(xí)能力、泛化能力以及平移不變性，可以高效處理大規(guī)模圖像且能夠轉(zhuǎn)換成圖像結(jié)構(gòu)的數(shù)據(jù)，解決了傳統(tǒng)方法需手動(dòng)提取特征帶來的耗時(shí)、準(zhǔn)確率低等問題，加之計(jì)算機(jī)性能有了很大的提升[3]，使得CNN得到了質(zhì)的發(fā)展，因此在圖像分類、目標(biāo)識(shí)別以及醫(yī)療診斷等領(lǐng)域被廣泛應(yīng)用[4]，且取得了顯著的成就。

本文詳細(xì)內(nèi)容請(qǐng)下載：http://m.ihrv.cn/resource/share/2000005634

作者信息：

郭慶梅1，于恒力2，王中訓(xùn)1，劉寧波2

（1.煙臺(tái)大學(xué) 物理與電子信息學(xué)院，山東煙臺(tái) 264005；2.海軍航空大學(xué) 信息融合研究所，山東煙臺(tái) 264001）

微信圖片_20210517164139.jpg

原創(chuàng)聲明：此內(nèi)容為AET網(wǎng)站原創(chuàng)，未經(jīng)授權(quán)禁止轉(zhuǎn)載。

相關(guān)內(nèi)容