FASTER CONVOLUTIONAL NEURAL NETWORKS TRAINING

Doctoral Candidate Name:

Shanshan Jiang

Program:

Computing and Information Systems

Abstract:

Convolutional Neural Network (CNN) models have become the mainstream method in Artificial Intelligence (AI) areas for computer vision tasks like image classification and image segmentation. Deep CNNs contain a large volume of convolution calculations. Training a large CNN may take days or even weeks, which is time-consuming and costly. When we need multiple runs to search for the optimal CNN hypermeter settings, it would take a couple of months with limited GPUs, which is not acceptable and hinders the development of CNNs. It is essential to train CNN faster.

We have proposed a novel Conditional Reduction (CR) module to compress a single 1×1 convolution layer. Then we have developed a novel three-layer Conditional block (C-block) to compress the CNN bottleneck or inverted bottlenecks. At last we have developed a novel Conditional Network (CRnet) based on the CR module and C-block. We have tested the CRnet on two image classification datasets: CIFAR-10 and CIFAR-100, with multiple network expansion ratios and compression ratios. The experiments verify our methods’ correctness with attention to the importance of the input-output pattern when selecting a compression strategy. The experiments show that our proposed CRnet better balances the model complexity and accuracy compared to the state-of-the-art group convolution and Ghost Module compression.

We have proposed a flat reduced random sampling training strategy and a bottleneck reduced random sampling strategy. We have proposed a three-stage training method based on the bottleneck reduced random sampling. Furthermore, we have proved the data visibility of a sample in the whole training process and the theoretical reduced time by four theorems and two corollaries. We have tested the two sampling methods on three image classification datasets: CIFAR-10, CIFAR-100 and ImageNet. The experiments show that our proposed two sampling strategies effectively reduce a significant training time percentage at a very small accuracy loss.

Defense Date and Time:

Thursday, June 17, 2021 - 1:00pm

Defense Location:

Zoom Meeting ID: 999 7040 7659 Passcode: 834935

Committee Chair's Name:

Dr. Sheng-Guo Wang

Committee Members:

Dr. Aidong Lu, Dr. Bei-Tseng Chu, Dr. Shen-en Chen

My UNC Charlotte

Directory

Campus Events

Library

Prospective Students

Faculty and Staff

Current Students

Alumni and Friends

FASTER CONVOLUTIONAL NEURAL NETWORKS TRAINING