[Paper] MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

1 분 소요

“MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications”이란 논문에 대한 리뷰입니다.

원문은 링크에서 확인할 수 있습니다.

Depthwise seperable convolution (DSC)연산

M(input channel), N(out channel), D-(input output kernel)이 된다.

기존에 (DF,DF,M) -> (DG,DG,N) 으로 바꾸던 것을 (DF,DF,M) -> (DG,DG,M) -> (DG,DG,N) 으로 바꾸도록 작업으로 나누어 진행
그래서 N DG2 DK2 M 연산량을 M DG2 DK2 + N DG2 M = M DG2 (DK2+N)으로 줄인 것

Channel reduction
Computational Complexity : eg. DSC
Number of Parameter : replace 33 kernel with 11 kernel / remove fc layer
Down sampling

The key idea MobileNet handles is DSG
Key

: Depth-wise separable convolution  trade-off between latency and accuracy
Related Work : Compressing pretrained network

Product quantization / hashing / pruning / vector quantization / Huffman coding / distillation

MobileNets with resource restrictions (latency & size)

Others: Flattened network / Factorized Network / Xception network / Squeezenet

For fast sparse matrix operation, uses optimized general matrix multiply (GEMM) function

Width Multiplier α: Thinner
Use less channel in each layer
Usually 1 / 0.75 / 0.5 / 0.25

Resolution Multiplier ρ: Reduced representation
Usually 224(Original Resolution) / 192 / 160 / 128